Warning: Permanently added '3.89.206.8' (ED25519) to the list of known hosts. Running (timeout=172800): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1746813413.875217 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.1 starting (python version = 3.13.0, NVR = mock-6.1-1.fc41), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1746813413.875217 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl/rccl.spec) Config(centos-stream+epel-10-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.1 INFO: Mock Version: 6.1 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/centos-stream+epel-10-x86_64-bootstrap-1746813413.875217/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using container image: quay.io/centos/centos:stream10 INFO: Pulling image: quay.io/centos/centos:stream10 INFO: Tagging container image as mock-bootstrap-27a9ac8d-2fc7-4a47-afa9-decf4e53c7b4 INFO: Checking that 7ace1fec11fa7fdd81c73f3fcfa2bff6b64822e93db282f33efc55b40ce24f9f image matches host's architecture INFO: Copy content of container 7ace1fec11fa7fdd81c73f3fcfa2bff6b64822e93db282f33efc55b40ce24f9f to /var/lib/mock/centos-stream+epel-10-x86_64-bootstrap-1746813413.875217/root INFO: mounting 7ace1fec11fa7fdd81c73f3fcfa2bff6b64822e93db282f33efc55b40ce24f9f with podman image mount INFO: image 7ace1fec11fa7fdd81c73f3fcfa2bff6b64822e93db282f33efc55b40ce24f9f as /var/lib/containers/storage/overlay/1133dd70d39278c6b7773fa847083576b7f561659047fcfb08aad24c7a02662c/merged INFO: umounting image 7ace1fec11fa7fdd81c73f3fcfa2bff6b64822e93db282f33efc55b40ce24f9f (/var/lib/containers/storage/overlay/1133dd70d39278c6b7773fa847083576b7f561659047fcfb08aad24c7a02662c/merged) with podman image umount INFO: Removing image mock-bootstrap-27a9ac8d-2fc7-4a47-afa9-decf4e53c7b4 INFO: Package manager dnf4 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf4 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.19.1.1-14.el10.x86_64 rpm-sequoia-1.6.0-6.el10.x86_64 python3-dnf-4.20.0-13.el10.noarch python3-dnf-plugins-core-4.7.0-9.el10.noarch yum-4.20.0-13.el10.noarch Start: installing minimal buildroot with dnf No matches found for the following disable plugin patterns: local, spacewalk, versionlock Copr repository 5.0 kB/s | 2.2 kB 00:00 Additional repo https_download_copr_fedorainfra 1.9 MB/s | 61 kB 00:00 CentOS Stream 10 - BaseOS 57 MB/s | 6.6 MB 00:00 CentOS Stream 10 - AppStream 24 MB/s | 3.2 MB 00:00 CentOS Stream 10 - CRB 8.9 MB/s | 724 kB 00:00 CentOS Stream 10 - Extras packages 91 kB/s | 4.7 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 50 MB/s | 5.0 MB 00:00 Dependencies resolved. ================================================================================ Package Arch Version Repo Size ================================================================================ Installing: bash x86_64 5.2.26-6.el10 baseos 1.8 M bzip2 x86_64 1.0.8-25.el10 baseos 54 k centos-stream-release noarch 10.0-8.el10 baseos 29 k coreutils x86_64 9.5-6.el10 baseos 1.1 M cpio x86_64 2.15-3.el10 baseos 292 k diffutils x86_64 3.10-8.el10 baseos 409 k epel-rpm-macros noarch 10-4.el10_1 epel 7.9 k findutils x86_64 1:4.10.0-5.el10 baseos 551 k gawk x86_64 5.3.0-6.el10 baseos 1.1 M glibc-minimal-langpack x86_64 2.39-37.el10 baseos 81 k grep x86_64 3.11-10.el10 baseos 301 k gzip x86_64 1.13-3.el10 baseos 170 k info x86_64 7.1-6.el10 baseos 183 k patch x86_64 2.7.6-26.el10 appstream 132 k redhat-rpm-config noarch 289-1.el10 appstream 78 k rpm-build x86_64 4.19.1.1-14.el10 appstream 77 k sed x86_64 4.9-3.el10 baseos 318 k shadow-utils x86_64 2:4.15.0-5.el10 baseos 1.3 M tar x86_64 2:1.35-7.el10 baseos 863 k unzip x86_64 6.0-66.el10 baseos 189 k util-linux x86_64 2.40.2-10.el10 baseos 1.2 M which x86_64 2.21-43.el10 baseos 42 k xz x86_64 1:5.6.2-3.el10 baseos 482 k Installing dependencies: alternatives x86_64 1.30-2.el10 baseos 42 k ansible-srpm-macros noarch 1-16.1.el10_0 epel 20 k audit-libs x86_64 4.0.3-4.el10 baseos 133 k authselect x86_64 1.5.0-8.el10 baseos 144 k authselect-libs x86_64 1.5.0-8.el10 baseos 223 k basesystem noarch 11-22.el10 baseos 4.3 k binutils x86_64 2.41-53.el10 baseos 6.4 M binutils-gold x86_64 2.41-53.el10 baseos 798 k bzip2-libs x86_64 1.0.8-25.el10 baseos 42 k ca-certificates noarch 2024.2.69_v8.0.303-102.3.el10 baseos 1.1 M centos-gpg-keys noarch 10.0-8.el10 baseos 14 k centos-stream-repos noarch 10.0-8.el10 baseos 12 k coreutils-common x86_64 9.5-6.el10 baseos 2.1 M cracklib x86_64 2.9.11-8.el10 baseos 96 k cracklib-dicts x86_64 2.9.11-8.el10 baseos 3.6 M crypto-policies noarch 20250424-2.git9267dee.el10 baseos 97 k curl x86_64 8.12.1-2.el10 baseos 219 k cyrus-sasl-lib x86_64 2.1.28-27.el10 baseos 106 k debugedit x86_64 5.1-7.el10 appstream 80 k dwz x86_64 0.15-7.el10 appstream 139 k ed x86_64 1.20-5.el10 baseos 83 k efi-srpm-macros noarch 6-6.el10 appstream 24 k elfutils x86_64 0.193-1.el10 baseos 573 k elfutils-debuginfod-client x86_64 0.193-1.el10 baseos 47 k elfutils-default-yama-scope noarch 0.193-1.el10 baseos 13 k elfutils-libelf x86_64 0.193-1.el10 baseos 208 k elfutils-libs x86_64 0.193-1.el10 baseos 269 k file x86_64 5.45-7.el10 baseos 50 k file-libs x86_64 5.45-7.el10 baseos 763 k filesystem x86_64 3.18-16.el10 baseos 4.7 M fonts-srpm-macros noarch 1:2.0.5-18.el10 appstream 27 k forge-srpm-macros noarch 0.4.0-6.el10 appstream 21 k gdb-minimal x86_64 14.2-4.el10 appstream 3.9 M gdbm x86_64 1:1.23-11.el10 baseos 151 k gdbm-libs x86_64 1:1.23-11.el10 baseos 57 k ghc-srpm-macros noarch 1.9.2-1.el10_0 epel 9.1 k glibc x86_64 2.39-37.el10 baseos 2.1 M glibc-common x86_64 2.39-37.el10 baseos 376 k glibc-gconv-extra x86_64 2.39-37.el10 baseos 1.7 M gmp x86_64 1:6.2.1-10.el10 baseos 317 k go-srpm-macros noarch 3.6.0-4.el10 appstream 28 k jansson x86_64 2.14-3.el10 baseos 46 k json-c x86_64 0.18-3.el10 baseos 45 k kernel-srpm-macros noarch 1.0-25.el10 appstream 11 k keyutils-libs x86_64 1.6.3-5.el10 baseos 33 k krb5-libs x86_64 1.21.3-7.el10 baseos 767 k libacl x86_64 2.3.2-4.el10 baseos 25 k libarchive x86_64 3.7.7-1.el10 baseos 416 k libattr x86_64 2.5.2-5.el10 baseos 19 k libblkid x86_64 2.40.2-10.el10 baseos 127 k libbrotli x86_64 1.1.0-6.el10 baseos 346 k libcap x86_64 2.69-7.el10 baseos 91 k libcap-ng x86_64 0.8.4-6.el10 baseos 33 k libcom_err x86_64 1.47.1-3.el10 baseos 27 k libcurl x86_64 8.12.1-2.el10 baseos 371 k libeconf x86_64 0.6.2-4.el10 baseos 33 k libevent x86_64 2.1.12-16.el10 baseos 261 k libfdisk x86_64 2.40.2-10.el10 baseos 161 k libffi x86_64 3.4.4-9.el10 baseos 41 k libgcc x86_64 14.2.1-7.el10 baseos 135 k libgomp x86_64 14.2.1-7.el10 baseos 352 k libidn2 x86_64 2.3.7-3.el10 baseos 118 k libmount x86_64 2.40.2-10.el10 baseos 157 k libnghttp2 x86_64 1.64.0-2.el10 baseos 78 k libpkgconf x86_64 2.1.0-3.el10 baseos 39 k libpsl x86_64 0.21.5-6.el10 baseos 65 k libpwquality x86_64 1.4.5-12.el10 baseos 123 k libselinux x86_64 3.8-1.el10 baseos 98 k libsemanage x86_64 3.8.1-1.el10 baseos 122 k libsepol x86_64 3.8-1.el10 baseos 348 k libsmartcols x86_64 2.40.2-10.el10 baseos 85 k libssh x86_64 0.11.1-1.el10 baseos 234 k libssh-config noarch 0.11.1-1.el10 baseos 9.7 k libstdc++ x86_64 14.2.1-7.el10 baseos 847 k libtasn1 x86_64 4.20.0-1.el10 baseos 75 k libunistring x86_64 1.1-10.el10 baseos 548 k libutempter x86_64 1.2.1-15.el10 baseos 27 k libuuid x86_64 2.40.2-10.el10 baseos 30 k libverto x86_64 0.3.2-10.el10 baseos 22 k libxcrypt x86_64 4.4.36-10.el10 baseos 120 k libxml2 x86_64 2.12.5-5.el10 baseos 692 k libzstd x86_64 1.5.5-9.el10 baseos 292 k lua-libs x86_64 5.4.6-7.el10 baseos 132 k lua-srpm-macros noarch 1-15.el10 appstream 9.8 k lz4-libs x86_64 1.9.4-8.el10 baseos 68 k mpfr x86_64 4.2.1-5.el10 baseos 346 k ncurses-base noarch 6.4-14.20240127.el10 baseos 100 k ncurses-libs x86_64 6.4-14.20240127.el10 baseos 338 k ocaml-srpm-macros noarch 10-4.el10 appstream 10 k openblas-srpm-macros noarch 2-19.el10 appstream 8.7 k openldap x86_64 2.6.8-3.el10 baseos 240 k openssl-libs x86_64 1:3.5.0-2.el10 baseos 2.6 M p11-kit x86_64 0.25.5-7.el10 baseos 497 k p11-kit-trust x86_64 0.25.5-7.el10 baseos 134 k package-notes-srpm-macros noarch 0.5-13.el10 appstream 10 k pam x86_64 1.6.1-7.el10 baseos 585 k pam-libs x86_64 1.6.1-7.el10 baseos 58 k pcre2 x86_64 10.44-1.el10.3 baseos 248 k pcre2-syntax noarch 10.44-1.el10.3 baseos 151 k perl-srpm-macros noarch 1-57.el10 appstream 9.5 k pkgconf x86_64 2.1.0-3.el10 baseos 45 k pkgconf-m4 noarch 2.1.0-3.el10 baseos 15 k pkgconf-pkg-config x86_64 2.1.0-3.el10 baseos 11 k popt x86_64 1.19-8.el10 baseos 66 k publicsuffix-list-dafsa noarch 20240107-5.el10 baseos 59 k pyproject-srpm-macros noarch 1.16.2-1.el10 appstream 15 k python-srpm-macros noarch 3.12-10.el10 appstream 24 k qt6-srpm-macros noarch 6.8.1-4.el10 appstream 11 k readline x86_64 8.2-11.el10 baseos 214 k rpm x86_64 4.19.1.1-14.el10 baseos 566 k rpm-build-libs x86_64 4.19.1.1-14.el10 baseos 95 k rpm-libs x86_64 4.19.1.1-14.el10 baseos 311 k rpm-sequoia x86_64 1.6.0-6.el10 baseos 874 k rust-toolset-srpm-macros noarch 1.85.0-2.el10 appstream 13 k setup noarch 2.14.5-4.el10 baseos 154 k sqlite-libs x86_64 3.46.1-3.el10 baseos 747 k systemd-libs x86_64 257-11.el10 baseos 819 k util-linux-core x86_64 2.40.2-10.el10 baseos 548 k xz-libs x86_64 1:5.6.2-3.el10 baseos 113 k zip x86_64 3.0-44.el10 baseos 270 k zlib-ng-compat x86_64 2.2.3-2.el10 baseos 79 k zstd x86_64 1.5.5-9.el10 baseos 464 k Transaction Summary ================================================================================ Install 145 Packages Total download size: 60 M Installed size: 186 M Downloading Packages: (1/145): alternatives-1.30-2.el10.x86_64.rpm 2.6 MB/s | 42 kB 00:00 (2/145): authselect-1.5.0-8.el10.x86_64.rpm 8.4 MB/s | 144 kB 00:00 (3/145): audit-libs-4.0.3-4.el10.x86_64.rpm 7.1 MB/s | 133 kB 00:00 (4/145): basesystem-11-22.el10.noarch.rpm 1.5 MB/s | 4.3 kB 00:00 (5/145): authselect-libs-1.5.0-8.el10.x86_64.rp 30 MB/s | 223 kB 00:00 (6/145): bash-5.2.26-6.el10.x86_64.rpm 164 MB/s | 1.8 MB 00:00 (7/145): binutils-gold-2.41-53.el10.x86_64.rpm 80 MB/s | 798 kB 00:00 (8/145): bzip2-1.0.8-25.el10.x86_64.rpm 9.2 MB/s | 54 kB 00:00 (9/145): bzip2-libs-1.0.8-25.el10.x86_64.rpm 10 MB/s | 42 kB 00:00 (10/145): centos-gpg-keys-10.0-8.el10.noarch.rp 3.5 MB/s | 14 kB 00:00 (11/145): binutils-2.41-53.el10.x86_64.rpm 230 MB/s | 6.4 MB 00:00 (12/145): ca-certificates-2024.2.69_v8.0.303-10 77 MB/s | 1.1 MB 00:00 (13/145): centos-stream-release-10.0-8.el10.noa 3.4 MB/s | 29 kB 00:00 (14/145): centos-stream-repos-10.0-8.el10.noarc 3.6 MB/s | 12 kB 00:00 (15/145): cpio-2.15-3.el10.x86_64.rpm 56 MB/s | 292 kB 00:00 (16/145): coreutils-9.5-6.el10.x86_64.rpm 127 MB/s | 1.1 MB 00:00 (17/145): coreutils-common-9.5-6.el10.x86_64.rp 173 MB/s | 2.1 MB 00:00 (18/145): cracklib-2.9.11-8.el10.x86_64.rpm 16 MB/s | 96 kB 00:00 (19/145): crypto-policies-20250424-2.git9267dee 16 MB/s | 97 kB 00:00 (20/145): curl-8.12.1-2.el10.x86_64.rpm 35 MB/s | 219 kB 00:00 (21/145): cyrus-sasl-lib-2.1.28-27.el10.x86_64. 24 MB/s | 106 kB 00:00 (22/145): diffutils-3.10-8.el10.x86_64.rpm 78 MB/s | 409 kB 00:00 (23/145): cracklib-dicts-2.9.11-8.el10.x86_64.r 186 MB/s | 3.6 MB 00:00 (24/145): ed-1.20-5.el10.x86_64.rpm 12 MB/s | 83 kB 00:00 (25/145): elfutils-0.193-1.el10.x86_64.rpm 82 MB/s | 573 kB 00:00 (26/145): elfutils-debuginfod-client-0.193-1.el 14 MB/s | 47 kB 00:00 (27/145): elfutils-default-yama-scope-0.193-1.e 4.8 MB/s | 13 kB 00:00 (28/145): elfutils-libelf-0.193-1.el10.x86_64.r 51 MB/s | 208 kB 00:00 (29/145): file-5.45-7.el10.x86_64.rpm 16 MB/s | 50 kB 00:00 (30/145): elfutils-libs-0.193-1.el10.x86_64.rpm 44 MB/s | 269 kB 00:00 (31/145): file-libs-5.45-7.el10.x86_64.rpm 129 MB/s | 763 kB 00:00 (32/145): findutils-4.10.0-5.el10.x86_64.rpm 70 MB/s | 551 kB 00:00 (33/145): gawk-5.3.0-6.el10.x86_64.rpm 120 MB/s | 1.1 MB 00:00 (34/145): filesystem-3.18-16.el10.x86_64.rpm 230 MB/s | 4.7 MB 00:00 (35/145): gdbm-1.23-11.el10.x86_64.rpm 13 MB/s | 151 kB 00:00 (36/145): gdbm-libs-1.23-11.el10.x86_64.rpm 8.7 MB/s | 57 kB 00:00 (37/145): glibc-common-2.39-37.el10.x86_64.rpm 86 MB/s | 376 kB 00:00 (38/145): glibc-gconv-extra-2.39-37.el10.x86_64 159 MB/s | 1.7 MB 00:00 (39/145): glibc-2.39-37.el10.x86_64.rpm 151 MB/s | 2.1 MB 00:00 (40/145): glibc-minimal-langpack-2.39-37.el10.x 8.1 MB/s | 81 kB 00:00 (41/145): gmp-6.2.1-10.el10.x86_64.rpm 63 MB/s | 317 kB 00:00 (42/145): grep-3.11-10.el10.x86_64.rpm 67 MB/s | 301 kB 00:00 (43/145): gzip-1.13-3.el10.x86_64.rpm 37 MB/s | 170 kB 00:00 (44/145): info-7.1-6.el10.x86_64.rpm 40 MB/s | 183 kB 00:00 (45/145): jansson-2.14-3.el10.x86_64.rpm 17 MB/s | 46 kB 00:00 (46/145): json-c-0.18-3.el10.x86_64.rpm 17 MB/s | 45 kB 00:00 (47/145): keyutils-libs-1.6.3-5.el10.x86_64.rpm 13 MB/s | 33 kB 00:00 (48/145): libacl-2.3.2-4.el10.x86_64.rpm 11 MB/s | 25 kB 00:00 (49/145): krb5-libs-1.21.3-7.el10.x86_64.rpm 137 MB/s | 767 kB 00:00 (50/145): libattr-2.5.2-5.el10.x86_64.rpm 5.2 MB/s | 19 kB 00:00 (51/145): libarchive-3.7.7-1.el10.x86_64.rpm 72 MB/s | 416 kB 00:00 (52/145): libblkid-2.40.2-10.el10.x86_64.rpm 32 MB/s | 127 kB 00:00 (53/145): libbrotli-1.1.0-6.el10.x86_64.rpm 77 MB/s | 346 kB 00:00 (54/145): libcap-ng-0.8.4-6.el10.x86_64.rpm 15 MB/s | 33 kB 00:00 (55/145): libcap-2.69-7.el10.x86_64.rpm 22 MB/s | 91 kB 00:00 (56/145): libcom_err-1.47.1-3.el10.x86_64.rpm 9.4 MB/s | 27 kB 00:00 (57/145): libeconf-0.6.2-4.el10.x86_64.rpm 12 MB/s | 33 kB 00:00 (58/145): libcurl-8.12.1-2.el10.x86_64.rpm 83 MB/s | 371 kB 00:00 (59/145): libevent-2.1.12-16.el10.x86_64.rpm 59 MB/s | 261 kB 00:00 (60/145): libfdisk-2.40.2-10.el10.x86_64.rpm 45 MB/s | 161 kB 00:00 (61/145): libffi-3.4.4-9.el10.x86_64.rpm 14 MB/s | 41 kB 00:00 (62/145): libgcc-14.2.1-7.el10.x86_64.rpm 41 MB/s | 135 kB 00:00 (63/145): libgomp-14.2.1-7.el10.x86_64.rpm 91 MB/s | 352 kB 00:00 (64/145): libidn2-2.3.7-3.el10.x86_64.rpm 27 MB/s | 118 kB 00:00 (65/145): libmount-2.40.2-10.el10.x86_64.rpm 41 MB/s | 157 kB 00:00 (66/145): libnghttp2-1.64.0-2.el10.x86_64.rpm 20 MB/s | 78 kB 00:00 (67/145): libpkgconf-2.1.0-3.el10.x86_64.rpm 12 MB/s | 39 kB 00:00 (68/145): libpsl-0.21.5-6.el10.x86_64.rpm 21 MB/s | 65 kB 00:00 (69/145): libpwquality-1.4.5-12.el10.x86_64.rpm 27 MB/s | 123 kB 00:00 (70/145): libselinux-3.8-1.el10.x86_64.rpm 21 MB/s | 98 kB 00:00 (71/145): libsemanage-3.8.1-1.el10.x86_64.rpm 28 MB/s | 122 kB 00:00 (72/145): libsepol-3.8-1.el10.x86_64.rpm 73 MB/s | 348 kB 00:00 (73/145): libsmartcols-2.40.2-10.el10.x86_64.rp 18 MB/s | 85 kB 00:00 (74/145): libssh-0.11.1-1.el10.x86_64.rpm 44 MB/s | 234 kB 00:00 (75/145): libssh-config-0.11.1-1.el10.noarch.rp 3.1 MB/s | 9.7 kB 00:00 (76/145): libstdc++-14.2.1-7.el10.x86_64.rpm 121 MB/s | 847 kB 00:00 (77/145): libtasn1-4.20.0-1.el10.x86_64.rpm 13 MB/s | 75 kB 00:00 (78/145): libunistring-1.1-10.el10.x86_64.rpm 79 MB/s | 548 kB 00:00 (79/145): libutempter-1.2.1-15.el10.x86_64.rpm 8.4 MB/s | 27 kB 00:00 (80/145): libuuid-2.40.2-10.el10.x86_64.rpm 10 MB/s | 30 kB 00:00 (81/145): libverto-0.3.2-10.el10.x86_64.rpm 8.5 MB/s | 22 kB 00:00 (82/145): libxcrypt-4.4.36-10.el10.x86_64.rpm 32 MB/s | 120 kB 00:00 (83/145): libxml2-2.12.5-5.el10.x86_64.rpm 128 MB/s | 692 kB 00:00 (84/145): libzstd-1.5.5-9.el10.x86_64.rpm 70 MB/s | 292 kB 00:00 (85/145): lua-libs-5.4.6-7.el10.x86_64.rpm 38 MB/s | 132 kB 00:00 (86/145): lz4-libs-1.9.4-8.el10.x86_64.rpm 20 MB/s | 68 kB 00:00 (87/145): mpfr-4.2.1-5.el10.x86_64.rpm 96 MB/s | 346 kB 00:00 (88/145): ncurses-base-6.4-14.20240127.el10.noa 30 MB/s | 100 kB 00:00 (89/145): ncurses-libs-6.4-14.20240127.el10.x86 85 MB/s | 338 kB 00:00 (90/145): openldap-2.6.8-3.el10.x86_64.rpm 67 MB/s | 240 kB 00:00 (91/145): p11-kit-0.25.5-7.el10.x86_64.rpm 67 MB/s | 497 kB 00:00 (92/145): p11-kit-trust-0.25.5-7.el10.x86_64.rp 18 MB/s | 134 kB 00:00 (93/145): openssl-libs-3.5.0-2.el10.x86_64.rpm 187 MB/s | 2.6 MB 00:00 (94/145): pam-libs-1.6.1-7.el10.x86_64.rpm 16 MB/s | 58 kB 00:00 (95/145): pam-1.6.1-7.el10.x86_64.rpm 78 MB/s | 585 kB 00:00 (96/145): pcre2-10.44-1.el10.3.x86_64.rpm 48 MB/s | 248 kB 00:00 (97/145): pcre2-syntax-10.44-1.el10.3.noarch.rp 29 MB/s | 151 kB 00:00 (98/145): pkgconf-2.1.0-3.el10.x86_64.rpm 13 MB/s | 45 kB 00:00 (99/145): pkgconf-m4-2.1.0-3.el10.noarch.rpm 5.5 MB/s | 15 kB 00:00 (100/145): pkgconf-pkg-config-2.1.0-3.el10.x86_ 4.5 MB/s | 11 kB 00:00 (101/145): popt-1.19-8.el10.x86_64.rpm 18 MB/s | 66 kB 00:00 (102/145): publicsuffix-list-dafsa-20240107-5.e 22 MB/s | 59 kB 00:00 (103/145): readline-8.2-11.el10.x86_64.rpm 52 MB/s | 214 kB 00:00 (104/145): rpm-build-libs-4.19.1.1-14.el10.x86_ 25 MB/s | 95 kB 00:00 (105/145): rpm-4.19.1.1-14.el10.x86_64.rpm 107 MB/s | 566 kB 00:00 (106/145): rpm-libs-4.19.1.1-14.el10.x86_64.rpm 72 MB/s | 311 kB 00:00 (107/145): rpm-sequoia-1.6.0-6.el10.x86_64.rpm 158 MB/s | 874 kB 00:00 (108/145): sed-4.9-3.el10.x86_64.rpm 58 MB/s | 318 kB 00:00 (109/145): setup-2.14.5-4.el10.noarch.rpm 29 MB/s | 154 kB 00:00 (110/145): shadow-utils-4.15.0-5.el10.x86_64.rp 148 MB/s | 1.3 MB 00:00 (111/145): sqlite-libs-3.46.1-3.el10.x86_64.rpm 73 MB/s | 747 kB 00:00 (112/145): systemd-libs-257-11.el10.x86_64.rpm 75 MB/s | 819 kB 00:00 (113/145): tar-1.35-7.el10.x86_64.rpm 140 MB/s | 863 kB 00:00 (114/145): unzip-6.0-66.el10.x86_64.rpm 34 MB/s | 189 kB 00:00 (115/145): which-2.21-43.el10.x86_64.rpm 14 MB/s | 42 kB 00:00 (116/145): util-linux-2.40.2-10.el10.x86_64.rpm 133 MB/s | 1.2 MB 00:00 (117/145): util-linux-core-2.40.2-10.el10.x86_6 74 MB/s | 548 kB 00:00 (118/145): xz-5.6.2-3.el10.x86_64.rpm 79 MB/s | 482 kB 00:00 (119/145): xz-libs-5.6.2-3.el10.x86_64.rpm 25 MB/s | 113 kB 00:00 (120/145): zip-3.0-44.el10.x86_64.rpm 49 MB/s | 270 kB 00:00 (121/145): zlib-ng-compat-2.2.3-2.el10.x86_64.r 19 MB/s | 79 kB 00:00 (122/145): zstd-1.5.5-9.el10.x86_64.rpm 95 MB/s | 464 kB 00:00 (123/145): debugedit-5.1-7.el10.x86_64.rpm 17 MB/s | 80 kB 00:00 (124/145): efi-srpm-macros-6-6.el10.noarch.rpm 10 MB/s | 24 kB 00:00 (125/145): dwz-0.15-7.el10.x86_64.rpm 36 MB/s | 139 kB 00:00 (126/145): fonts-srpm-macros-2.0.5-18.el10.noar 12 MB/s | 27 kB 00:00 (127/145): forge-srpm-macros-0.4.0-6.el10.noarc 7.1 MB/s | 21 kB 00:00 (128/145): go-srpm-macros-3.6.0-4.el10.noarch.r 12 MB/s | 28 kB 00:00 (129/145): kernel-srpm-macros-1.0-25.el10.noarc 4.4 MB/s | 11 kB 00:00 (130/145): lua-srpm-macros-1-15.el10.noarch.rpm 4.5 MB/s | 9.8 kB 00:00 (131/145): ocaml-srpm-macros-10-4.el10.noarch.r 3.3 MB/s | 10 kB 00:00 (132/145): openblas-srpm-macros-2-19.el10.noarc 3.7 MB/s | 8.7 kB 00:00 (133/145): gdb-minimal-14.2-4.el10.x86_64.rpm 271 MB/s | 3.9 MB 00:00 (134/145): package-notes-srpm-macros-0.5-13.el1 1.6 MB/s | 10 kB 00:00 (135/145): patch-2.7.6-26.el10.x86_64.rpm 18 MB/s | 132 kB 00:00 (136/145): perl-srpm-macros-1-57.el10.noarch.rp 3.8 MB/s | 9.5 kB 00:00 (137/145): pyproject-srpm-macros-1.16.2-1.el10. 5.9 MB/s | 15 kB 00:00 (138/145): python-srpm-macros-3.12-10.el10.noar 11 MB/s | 24 kB 00:00 (139/145): qt6-srpm-macros-6.8.1-4.el10.noarch. 4.7 MB/s | 11 kB 00:00 (140/145): redhat-rpm-config-289-1.el10.noarch. 23 MB/s | 78 kB 00:00 (141/145): rpm-build-4.19.1.1-14.el10.x86_64.rp 24 MB/s | 77 kB 00:00 (142/145): rust-toolset-srpm-macros-1.85.0-2.el 5.3 MB/s | 13 kB 00:00 (143/145): epel-rpm-macros-10-4.el10_1.noarch.r 841 kB/s | 7.9 kB 00:00 (144/145): ghc-srpm-macros-1.9.2-1.el10_0.noarc 995 kB/s | 9.1 kB 00:00 (145/145): ansible-srpm-macros-1-16.1.el10_0.no 1.9 MB/s | 20 kB 00:00 -------------------------------------------------------------------------------- Total 145 MB/s | 60 MB 00:00 CentOS Stream 10 - BaseOS 1.6 MB/s | 1.6 kB 00:00 Importing GPG key 0x8483C65D: Userid : "CentOS (CentOS Official Signing Key) " Fingerprint: 99DB 70FA E1D7 CE22 7FB6 4882 05B5 55B3 8483 C65D From : /usr/share/distribution-gpg-keys/centos/RPM-GPG-KEY-CentOS-Official-SHA256 Key imported successfully Extra Packages for Enterprise Linux 10 - x86_64 1.6 MB/s | 1.6 kB 00:00 Importing GPG key 0xE37ED158: Userid : "Fedora (epel10) " Fingerprint: 7D8D 15CB FC4E 6268 8591 FB26 33D9 8517 E37E D158 From : /usr/share/distribution-gpg-keys/epel/RPM-GPG-KEY-EPEL-10 Key imported successfully Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Running scriptlet: filesystem-3.18-16.el10.x86_64 1/1 Preparing : 1/1 Installing : libgcc-14.2.1-7.el10.x86_64 1/145 Running scriptlet: libgcc-14.2.1-7.el10.x86_64 1/145 Installing : ghc-srpm-macros-1.9.2-1.el10_0.noarch 2/145 Installing : ansible-srpm-macros-1-16.1.el10_0.noarch 3/145 Installing : rust-toolset-srpm-macros-1.85.0-2.el10.noarch 4/145 Installing : qt6-srpm-macros-6.8.1-4.el10.noarch 5/145 Installing : perl-srpm-macros-1-57.el10.noarch 6/145 Installing : package-notes-srpm-macros-0.5-13.el10.noarch 7/145 Installing : openblas-srpm-macros-2-19.el10.noarch 8/145 Installing : ocaml-srpm-macros-10-4.el10.noarch 9/145 Installing : kernel-srpm-macros-1.0-25.el10.noarch 10/145 Installing : publicsuffix-list-dafsa-20240107-5.el10.noarch 11/145 Installing : pkgconf-m4-2.1.0-3.el10.noarch 12/145 Installing : pcre2-syntax-10.44-1.el10.3.noarch 13/145 Installing : ncurses-base-6.4-14.20240127.el10.noarch 14/145 Installing : libssh-config-0.11.1-1.el10.noarch 15/145 Installing : coreutils-common-9.5-6.el10.x86_64 16/145 Installing : centos-gpg-keys-10.0-8.el10.noarch 17/145 Installing : centos-stream-repos-10.0-8.el10.noarch 18/145 Installing : centos-stream-release-10.0-8.el10.noarch 19/145 Installing : setup-2.14.5-4.el10.noarch 20/145 warning: /etc/hosts created as /etc/hosts.rpmnew Running scriptlet: setup-2.14.5-4.el10.noarch 20/145 Installing : filesystem-3.18-16.el10.x86_64 21/145 Installing : basesystem-11-22.el10.noarch 22/145 Installing : glibc-gconv-extra-2.39-37.el10.x86_64 23/145 Running scriptlet: glibc-gconv-extra-2.39-37.el10.x86_64 23/145 Installing : glibc-minimal-langpack-2.39-37.el10.x86_64 24/145 Installing : glibc-common-2.39-37.el10.x86_64 25/145 Running scriptlet: glibc-2.39-37.el10.x86_64 26/145 Installing : glibc-2.39-37.el10.x86_64 26/145 Running scriptlet: glibc-2.39-37.el10.x86_64 26/145 Installing : ncurses-libs-6.4-14.20240127.el10.x86_64 27/145 Installing : bash-5.2.26-6.el10.x86_64 28/145 Running scriptlet: bash-5.2.26-6.el10.x86_64 28/145 Installing : zlib-ng-compat-2.2.3-2.el10.x86_64 29/145 Installing : bzip2-libs-1.0.8-25.el10.x86_64 30/145 Installing : xz-libs-1:5.6.2-3.el10.x86_64 31/145 Installing : readline-8.2-11.el10.x86_64 32/145 Installing : libstdc++-14.2.1-7.el10.x86_64 33/145 Installing : libuuid-2.40.2-10.el10.x86_64 34/145 Installing : popt-1.19-8.el10.x86_64 35/145 Installing : libblkid-2.40.2-10.el10.x86_64 36/145 Installing : gmp-1:6.2.1-10.el10.x86_64 37/145 Installing : libattr-2.5.2-5.el10.x86_64 38/145 Installing : libacl-2.3.2-4.el10.x86_64 39/145 Installing : libxcrypt-4.4.36-10.el10.x86_64 40/145 Installing : libzstd-1.5.5-9.el10.x86_64 41/145 Installing : elfutils-libelf-0.193-1.el10.x86_64 42/145 Running scriptlet: crypto-policies-20250424-2.git9267dee.el10.noarc 43/145 Installing : crypto-policies-20250424-2.git9267dee.el10.noarc 43/145 Running scriptlet: crypto-policies-20250424-2.git9267dee.el10.noarc 43/145 Installing : gdbm-libs-1:1.23-11.el10.x86_64 44/145 Installing : libeconf-0.6.2-4.el10.x86_64 45/145 Installing : dwz-0.15-7.el10.x86_64 46/145 Installing : mpfr-4.2.1-5.el10.x86_64 47/145 Installing : gawk-5.3.0-6.el10.x86_64 48/145 Installing : unzip-6.0-66.el10.x86_64 49/145 Installing : file-libs-5.45-7.el10.x86_64 50/145 Installing : file-5.45-7.el10.x86_64 51/145 Installing : alternatives-1.30-2.el10.x86_64 52/145 Installing : jansson-2.14-3.el10.x86_64 53/145 Installing : libcap-ng-0.8.4-6.el10.x86_64 54/145 Installing : audit-libs-4.0.3-4.el10.x86_64 55/145 Installing : pam-libs-1.6.1-7.el10.x86_64 56/145 Installing : libcap-2.69-7.el10.x86_64 57/145 Installing : systemd-libs-257-11.el10.x86_64 58/145 Installing : libsepol-3.8-1.el10.x86_64 59/145 Installing : libsmartcols-2.40.2-10.el10.x86_64 60/145 Installing : libtasn1-4.20.0-1.el10.x86_64 61/145 Installing : libunistring-1.1-10.el10.x86_64 62/145 Installing : libidn2-2.3.7-3.el10.x86_64 63/145 Installing : lua-libs-5.4.6-7.el10.x86_64 64/145 Installing : lz4-libs-1.9.4-8.el10.x86_64 65/145 Installing : pcre2-10.44-1.el10.3.x86_64 66/145 Installing : libselinux-3.8-1.el10.x86_64 67/145 Installing : sed-4.9-3.el10.x86_64 68/145 Installing : findutils-1:4.10.0-5.el10.x86_64 69/145 Installing : grep-3.11-10.el10.x86_64 70/145 Installing : xz-1:5.6.2-3.el10.x86_64 71/145 Installing : libmount-2.40.2-10.el10.x86_64 72/145 Installing : util-linux-core-2.40.2-10.el10.x86_64 73/145 Installing : libsemanage-3.8.1-1.el10.x86_64 74/145 Installing : shadow-utils-2:4.15.0-5.el10.x86_64 75/145 Running scriptlet: libutempter-1.2.1-15.el10.x86_64 76/145 Installing : libutempter-1.2.1-15.el10.x86_64 76/145 Installing : tar-2:1.35-7.el10.x86_64 77/145 Installing : zstd-1.5.5-9.el10.x86_64 78/145 Installing : libpsl-0.21.5-6.el10.x86_64 79/145 Installing : zip-3.0-44.el10.x86_64 80/145 Installing : cyrus-sasl-lib-2.1.28-27.el10.x86_64 81/145 Installing : gdbm-1:1.23-11.el10.x86_64 82/145 Installing : libfdisk-2.40.2-10.el10.x86_64 83/145 Installing : libxml2-2.12.5-5.el10.x86_64 84/145 Installing : bzip2-1.0.8-25.el10.x86_64 85/145 Installing : sqlite-libs-3.46.1-3.el10.x86_64 86/145 Installing : ed-1.20-5.el10.x86_64 87/145 Installing : patch-2.7.6-26.el10.x86_64 88/145 Installing : elfutils-default-yama-scope-0.193-1.el10.noarch 89/145 Running scriptlet: elfutils-default-yama-scope-0.193-1.el10.noarch 89/145 Installing : cpio-2.15-3.el10.x86_64 90/145 Installing : diffutils-3.10-8.el10.x86_64 91/145 Installing : json-c-0.18-3.el10.x86_64 92/145 Installing : keyutils-libs-1.6.3-5.el10.x86_64 93/145 Installing : libbrotli-1.1.0-6.el10.x86_64 94/145 Installing : libcom_err-1.47.1-3.el10.x86_64 95/145 Installing : libffi-3.4.4-9.el10.x86_64 96/145 Installing : p11-kit-0.25.5-7.el10.x86_64 97/145 Installing : p11-kit-trust-0.25.5-7.el10.x86_64 98/145 Running scriptlet: p11-kit-trust-0.25.5-7.el10.x86_64 98/145 Installing : openssl-libs-1:3.5.0-2.el10.x86_64 99/145 Installing : coreutils-9.5-6.el10.x86_64 100/145 Running scriptlet: ca-certificates-2024.2.69_v8.0.303-102.3.el10.no 101/145 Installing : ca-certificates-2024.2.69_v8.0.303-102.3.el10.no 101/145 Running scriptlet: ca-certificates-2024.2.69_v8.0.303-102.3.el10.no 101/145 Installing : authselect-libs-1.5.0-8.el10.x86_64 102/145 Installing : gzip-1.13-3.el10.x86_64 103/145 Installing : cracklib-2.9.11-8.el10.x86_64 104/145 Installing : libarchive-3.7.7-1.el10.x86_64 105/145 Installing : cracklib-dicts-2.9.11-8.el10.x86_64 106/145 Installing : libpwquality-1.4.5-12.el10.x86_64 107/145 Installing : pam-1.6.1-7.el10.x86_64 108/145 Installing : libevent-2.1.12-16.el10.x86_64 109/145 Installing : openldap-2.6.8-3.el10.x86_64 110/145 Installing : rpm-sequoia-1.6.0-6.el10.x86_64 111/145 Installing : rpm-libs-4.19.1.1-14.el10.x86_64 112/145 Installing : libgomp-14.2.1-7.el10.x86_64 113/145 Installing : libnghttp2-1.64.0-2.el10.x86_64 114/145 Installing : libpkgconf-2.1.0-3.el10.x86_64 115/145 Installing : pkgconf-2.1.0-3.el10.x86_64 116/145 Installing : pkgconf-pkg-config-2.1.0-3.el10.x86_64 117/145 Installing : libverto-0.3.2-10.el10.x86_64 118/145 Installing : krb5-libs-1.21.3-7.el10.x86_64 119/145 Installing : libssh-0.11.1-1.el10.x86_64 120/145 Installing : libcurl-8.12.1-2.el10.x86_64 121/145 Installing : elfutils-libs-0.193-1.el10.x86_64 122/145 Installing : elfutils-debuginfod-client-0.193-1.el10.x86_64 123/145 Installing : binutils-gold-2.41-53.el10.x86_64 124/145 Running scriptlet: binutils-gold-2.41-53.el10.x86_64 124/145 Installing : binutils-2.41-53.el10.x86_64 125/145 Running scriptlet: binutils-2.41-53.el10.x86_64 125/145 Installing : elfutils-0.193-1.el10.x86_64 126/145 Installing : gdb-minimal-14.2-4.el10.x86_64 127/145 Installing : debugedit-5.1-7.el10.x86_64 128/145 Installing : rpm-build-libs-4.19.1.1-14.el10.x86_64 129/145 Installing : curl-8.12.1-2.el10.x86_64 130/145 Running scriptlet: rpm-4.19.1.1-14.el10.x86_64 131/145 Installing : rpm-4.19.1.1-14.el10.x86_64 131/145 Installing : efi-srpm-macros-6-6.el10.noarch 132/145 Installing : lua-srpm-macros-1-15.el10.noarch 133/145 Installing : fonts-srpm-macros-1:2.0.5-18.el10.noarch 134/145 Installing : forge-srpm-macros-0.4.0-6.el10.noarch 135/145 Installing : go-srpm-macros-3.6.0-4.el10.noarch 136/145 Installing : python-srpm-macros-3.12-10.el10.noarch 137/145 Installing : redhat-rpm-config-289-1.el10.noarch 138/145 Installing : rpm-build-4.19.1.1-14.el10.x86_64 139/145 Installing : pyproject-srpm-macros-1.16.2-1.el10.noarch 140/145 Installing : util-linux-2.40.2-10.el10.x86_64 141/145 Running scriptlet: util-linux-2.40.2-10.el10.x86_64 141/145 Installing : authselect-1.5.0-8.el10.x86_64 142/145 Installing : which-2.21-43.el10.x86_64 143/145 Installing : info-7.1-6.el10.x86_64 144/145 Installing : epel-rpm-macros-10-4.el10_1.noarch 145/145 Running scriptlet: filesystem-3.18-16.el10.x86_64 145/145 Running scriptlet: ca-certificates-2024.2.69_v8.0.303-102.3.el10.no 145/145 Running scriptlet: authselect-libs-1.5.0-8.el10.x86_64 145/145 Running scriptlet: rpm-4.19.1.1-14.el10.x86_64 145/145 Running scriptlet: epel-rpm-macros-10-4.el10_1.noarch 145/145 Installed: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-53.el10.x86_64 binutils-gold-2.41-53.el10.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2024.2.69_v8.0.303-102.3.el10.noarch centos-gpg-keys-10.0-8.el10.noarch centos-stream-release-10.0-8.el10.noarch centos-stream-repos-10.0-8.el10.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250424-2.git9267dee.el10.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-27.el10.x86_64 debugedit-5.1-7.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.15-7.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-4.el10_1.noarch file-5.45-7.el10.x86_64 file-libs-5.45-7.el10.x86_64 filesystem-3.18-16.el10.x86_64 findutils-1:4.10.0-5.el10.x86_64 fonts-srpm-macros-1:2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-14.2-4.el10.x86_64 gdbm-1:1.23-11.el10.x86_64 gdbm-libs-1:1.23-11.el10.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-37.el10.x86_64 glibc-common-2.39-37.el10.x86_64 glibc-gconv-extra-2.39-37.el10.x86_64 glibc-minimal-langpack-2.39-37.el10.x86_64 gmp-1:6.2.1-10.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-7.el10.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-1.el10.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-10.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-3.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-10.el10.x86_64 libffi-3.4.4-9.el10.x86_64 libgcc-14.2.1-7.el10.x86_64 libgomp-14.2.1-7.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-10.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.8-1.el10.x86_64 libsemanage-3.8.1-1.el10.x86_64 libsepol-3.8-1.el10.x86_64 libsmartcols-2.40.2-10.el10.x86_64 libssh-0.11.1-1.el10.x86_64 libssh-config-0.11.1-1.el10.noarch libstdc++-14.2.1-7.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-10.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-5.el10.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.8-3.el10.x86_64 openssl-libs-1:3.5.0-2.el10.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-7.el10.x86_64 pam-libs-1.6.1-7.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.8.1-4.el10.noarch readline-8.2-11.el10.x86_64 redhat-rpm-config-289-1.el10.noarch rpm-4.19.1.1-14.el10.x86_64 rpm-build-4.19.1.1-14.el10.x86_64 rpm-build-libs-4.19.1.1-14.el10.x86_64 rpm-libs-4.19.1.1-14.el10.x86_64 rpm-sequoia-1.6.0-6.el10.x86_64 rust-toolset-srpm-macros-1.85.0-2.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-4.el10.noarch shadow-utils-2:4.15.0-5.el10.x86_64 sqlite-libs-3.46.1-3.el10.x86_64 systemd-libs-257-11.el10.x86_64 tar-2:1.35-7.el10.x86_64 unzip-6.0-66.el10.x86_64 util-linux-2.40.2-10.el10.x86_64 util-linux-core-2.40.2-10.el10.x86_64 which-2.21-43.el10.x86_64 xz-1:5.6.2-3.el10.x86_64 xz-libs-1:5.6.2-3.el10.x86_64 zip-3.0-44.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Complete! Finish: installing minimal buildroot with dnf Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-53.el10.x86_64 binutils-gold-2.41-53.el10.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2024.2.69_v8.0.303-102.3.el10.noarch centos-gpg-keys-10.0-8.el10.noarch centos-stream-release-10.0-8.el10.noarch centos-stream-repos-10.0-8.el10.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250424-2.git9267dee.el10.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-27.el10.x86_64 debugedit-5.1-7.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.15-7.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-4.el10_1.noarch file-5.45-7.el10.x86_64 file-libs-5.45-7.el10.x86_64 filesystem-3.18-16.el10.x86_64 findutils-4.10.0-5.el10.x86_64 fonts-srpm-macros-2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-14.2-4.el10.x86_64 gdbm-1.23-11.el10.x86_64 gdbm-libs-1.23-11.el10.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-37.el10.x86_64 glibc-common-2.39-37.el10.x86_64 glibc-gconv-extra-2.39-37.el10.x86_64 glibc-minimal-langpack-2.39-37.el10.x86_64 gmp-6.2.1-10.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch gpg-pubkey-8483c65d-5ccc5b19 gpg-pubkey-e37ed158-65785fa9 grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-7.el10.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-1.el10.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-10.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-3.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-10.el10.x86_64 libffi-3.4.4-9.el10.x86_64 libgcc-14.2.1-7.el10.x86_64 libgomp-14.2.1-7.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-10.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.8-1.el10.x86_64 libsemanage-3.8.1-1.el10.x86_64 libsepol-3.8-1.el10.x86_64 libsmartcols-2.40.2-10.el10.x86_64 libssh-0.11.1-1.el10.x86_64 libssh-config-0.11.1-1.el10.noarch libstdc++-14.2.1-7.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-10.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-5.el10.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.8-3.el10.x86_64 openssl-libs-3.5.0-2.el10.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-7.el10.x86_64 pam-libs-1.6.1-7.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.8.1-4.el10.noarch readline-8.2-11.el10.x86_64 redhat-rpm-config-289-1.el10.noarch rpm-4.19.1.1-14.el10.x86_64 rpm-build-4.19.1.1-14.el10.x86_64 rpm-build-libs-4.19.1.1-14.el10.x86_64 rpm-libs-4.19.1.1-14.el10.x86_64 rpm-sequoia-1.6.0-6.el10.x86_64 rust-toolset-srpm-macros-1.85.0-2.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-4.el10.noarch shadow-utils-4.15.0-5.el10.x86_64 sqlite-libs-3.46.1-3.el10.x86_64 systemd-libs-257-11.el10.x86_64 tar-1.35-7.el10.x86_64 unzip-6.0-66.el10.x86_64 util-linux-2.40.2-10.el10.x86_64 util-linux-core-2.40.2-10.el10.x86_64 which-2.21-43.el10.x86_64 xz-5.6.2-3.el10.x86_64 xz-libs-5.6.2-3.el10.x86_64 zip-3.0-44.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Start: buildsrpm Start: rpmbuild -bs warning: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1745366400 Wrote: /builddir/build/SRPMS/rccl-6.4.0-2.el10.src.rpm RPM build warnings: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 Finish: rpmbuild -bs INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.log /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.librepo.log /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.rpm.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-tijfnks1/rccl/rccl.spec) Config(child) 0 minutes 23 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/rccl-6.4.0-2.el10.src.rpm) Config(centos-stream+epel-10-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/centos-stream+epel-10-x86_64-bootstrap-1746813413.875217/root. INFO: reusing tmpfs at /var/lib/mock/centos-stream+epel-10-x86_64-bootstrap-1746813413.875217/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.19.1.1-14.el10.x86_64 rpm-sequoia-1.6.0-6.el10.x86_64 python3-dnf-4.20.0-13.el10.noarch python3-dnf-plugins-core-4.7.0-9.el10.noarch yum-4.20.0-13.el10.noarch Finish: chroot init Start: build phase for rccl-6.4.0-2.el10.src.rpm Start: build setup for rccl-6.4.0-2.el10.src.rpm warning: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1745366400 Wrote: /builddir/build/SRPMS/rccl-6.4.0-2.el10.src.rpm RPM build warnings: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 No matches found for the following disable plugin patterns: local, spacewalk, versionlock Copr repository 86 kB/s | 1.5 kB 00:00 Additional repo https_download_copr_fedorainfra 109 kB/s | 1.8 kB 00:00 CentOS Stream 10 - BaseOS 18 kB/s | 5.3 kB 00:00 CentOS Stream 10 - AppStream 53 kB/s | 5.4 kB 00:00 CentOS Stream 10 - CRB 52 kB/s | 5.2 kB 00:00 CentOS Stream 10 - Extras packages 44 kB/s | 5.7 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 954 kB/s | 33 kB 00:00 Dependencies resolved. ======================================================================================================================================================================================= Package Arch Version Repository Size ======================================================================================================================================================================================= Installing: cmake x86_64 3.30.5-3.el10 appstream 12 M gcc-c++ x86_64 14.2.1-7.el10 appstream 15 M hipify x86_64 6.4.0-1.el10 copr_base 505 k rocm-cmake noarch 6.4.0-1.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 38 k rocm-comgr-devel x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 32 k rocm-core-devel x86_64 6.4.0-1.el10 copr_base 13 k rocm-hip-devel x86_64 6.4.0-3.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 233 k rocm-rpm-macros noarch 6.4.0-4.limitedisa.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 16 k rocm-runtime-devel x86_64 6.4.0-1.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 94 k rocm-smi-devel x86_64 6.4.0-2.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 57 k Installing dependencies: annobin-docs noarch 12.93-1.el10 appstream 92 k annobin-plugin-gcc x86_64 12.93-1.el10 appstream 987 k cmake-data noarch 3.30.5-3.el10 appstream 2.5 M cmake-filesystem x86_64 3.30.5-3.el10 appstream 23 k cmake-rpm-macros noarch 3.30.5-3.el10 appstream 16 k cpp x86_64 14.2.1-7.el10 appstream 12 M emacs-filesystem noarch 1:29.4-10.el10 appstream 10 k environment-modules x86_64 5.3.1-8.el10 baseos 707 k expat x86_64 2.6.4-1.el10 baseos 115 k gcc x86_64 14.2.1-7.el10 appstream 37 M gcc-plugin-annobin x86_64 14.2.1-7.el10 appstream 60 k git x86_64 2.47.1-1.el10 appstream 52 k git-core x86_64 2.47.1-1.el10 appstream 4.7 M git-core-doc noarch 2.47.1-1.el10 appstream 3.1 M glibc-devel x86_64 2.39-37.el10 appstream 637 k gnupg2 x86_64 2.4.5-2.el10 baseos 2.7 M gnutls x86_64 3.8.9-14.el10 baseos 1.4 M groff-base x86_64 1.23.0-10.el10 baseos 1.1 M hipcc x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 134 k hwdata noarch 0.379-10.4.el10 baseos 1.6 M kernel-headers x86_64 6.12.0-80.el10 appstream 2.2 M less x86_64 661-3.el10 baseos 191 k libassuan x86_64 2.5.6-6.el10 baseos 69 k libcbor x86_64 0.11.0-3.el10 baseos 34 k libdrm x86_64 2.4.123-1.el10 appstream 163 k libdrm-devel x86_64 2.4.123-1.el10 appstream 181 k libedit x86_64 3.1-52.20230828cvs.el10 baseos 106 k libfido2 x86_64 1.14.0-7.el10 baseos 98 k libgcrypt x86_64 1.11.0-6.el10 baseos 587 k libgpg-error x86_64 1.50-2.el10 baseos 236 k libksba x86_64 1.6.7-2.el10 baseos 159 k libmpc x86_64 1.3.1-7.el10 appstream 71 k libpciaccess x86_64 0.16-16.el10 baseos 28 k libpciaccess-devel x86_64 0.16-16.el10 appstream 14 k libpipeline x86_64 1.5.7-7.el10 baseos 53 k libstdc++-devel x86_64 14.2.1-7.el10 appstream 2.8 M libuv x86_64 1:1.49.2-2.el10 appstream 260 k libxcrypt-devel x86_64 4.4.36-10.el10 appstream 30 k make x86_64 1:4.4.1-9.el10 baseos 587 k man-db x86_64 2.12.0-8.el10 baseos 1.3 M mpdecimal x86_64 2.5.1-12.el10 baseos 90 k ncurses x86_64 6.4-14.20240127.el10 baseos 423 k npth x86_64 1.6-21.el10 baseos 26 k numactl-libs x86_64 2.0.19-1.el10 baseos 32 k openssh x86_64 9.9p1-9.el10 baseos 353 k openssh-clients x86_64 9.9p1-9.el10 baseos 763 k perl x86_64 4:5.40.2-513.el10 appstream 11 k perl-Algorithm-Diff noarch 1.2010-14.el10 appstream 47 k perl-Archive-Tar noarch 3.02-512.el10 appstream 72 k perl-Archive-Zip noarch 1.68-17.el10 appstream 111 k perl-Attribute-Handlers noarch 1.03-513.el10 appstream 29 k perl-AutoLoader noarch 5.74-513.el10 appstream 22 k perl-AutoSplit noarch 5.74-513.el10 appstream 23 k perl-B x86_64 1.89-513.el10 appstream 178 k perl-Benchmark noarch 1.25-513.el10 appstream 28 k perl-CPAN noarch 2.36-512.el10 appstream 567 k perl-CPAN-Meta noarch 2.150010-511.el10 appstream 198 k perl-CPAN-Meta-Requirements noarch 2.143-11.el10 appstream 36 k perl-CPAN-Meta-YAML noarch 0.018-512.el10 appstream 27 k perl-Carp noarch 1.54-511.el10 baseos 30 k perl-Class-Struct noarch 0.68-513.el10 appstream 23 k perl-Compress-Bzip2 x86_64 2.28-21.el10 appstream 68 k perl-Compress-Raw-Bzip2 x86_64 2.212-512.el10 appstream 36 k perl-Compress-Raw-Lzma x86_64 2.212-3.el10 appstream 52 k perl-Compress-Raw-Zlib x86_64 2.212-512.el10 appstream 66 k perl-Config-Extensions noarch 0.03-513.el10 appstream 13 k perl-Config-Perl-V noarch 0.36-512.el10 appstream 23 k perl-DBM_Filter noarch 0.06-513.el10 appstream 28 k perl-Data-Dumper x86_64 2.189-512.el10 baseos 57 k perl-Data-OptList noarch 0.114-7.el10 appstream 28 k perl-Data-Section noarch 0.200008-8.el10 appstream 26 k perl-Devel-PPPort x86_64 3.72-512.el10 appstream 220 k perl-Devel-Peek x86_64 1.34-513.el10 appstream 33 k perl-Devel-SelfStubber noarch 1.06-513.el10 appstream 15 k perl-Devel-Size x86_64 0.84-3.el10 appstream 32 k perl-Digest noarch 1.20-511.el10 baseos 26 k perl-Digest-MD5 x86_64 2.59-6.el10 baseos 37 k perl-Digest-SHA x86_64 1:6.04-512.el10 appstream 63 k perl-DirHandle noarch 1.05-513.el10 appstream 13 k perl-Dumpvalue noarch 2.27-513.el10 appstream 19 k perl-DynaLoader x86_64 1.56-513.el10 appstream 27 k perl-Encode x86_64 4:3.21-511.el10 baseos 1.1 M perl-Encode-devel x86_64 4:3.21-511.el10 appstream 42 k perl-English noarch 1.11-513.el10 appstream 15 k perl-Env noarch 1.06-511.el10 appstream 21 k perl-Errno x86_64 1.38-513.el10 appstream 16 k perl-Error noarch 1:0.17029-18.el10 appstream 42 k perl-Exporter noarch 5.78-511.el10 baseos 32 k perl-ExtUtils-CBuilder noarch 1:0.280240-511.el10 appstream 50 k perl-ExtUtils-Command noarch 2:7.70-513.el10 appstream 15 k perl-ExtUtils-Constant noarch 0.25-513.el10 appstream 45 k perl-ExtUtils-Embed noarch 1.35-513.el10 appstream 19 k perl-ExtUtils-Install noarch 2.22-511.el10 appstream 44 k perl-ExtUtils-MM-Utils noarch 2:7.70-513.el10 appstream 13 k perl-ExtUtils-MakeMaker noarch 2:7.70-513.el10 appstream 293 k perl-ExtUtils-Manifest noarch 1:1.75-511.el10 appstream 35 k perl-ExtUtils-Miniperl noarch 1.14-513.el10 appstream 16 k perl-ExtUtils-ParseXS noarch 1:3.51-512.el10 appstream 186 k perl-Fcntl x86_64 1.18-513.el10 appstream 31 k perl-File-Basename noarch 2.86-513.el10 appstream 18 k perl-File-Compare noarch 1.100.800-513.el10 appstream 14 k perl-File-Copy noarch 2.41-513.el10 appstream 21 k perl-File-DosGlob x86_64 1.12-513.el10 appstream 21 k perl-File-Fetch noarch 1.04-511.el10 appstream 31 k perl-File-Find noarch 1.44-513.el10 appstream 26 k perl-File-HomeDir noarch 1.006-15.el10 appstream 58 k perl-File-Path noarch 2.18-511.el10 baseos 36 k perl-File-Temp noarch 1:0.231.100-512.el10 baseos 60 k perl-File-Which noarch 1.27-14.el10 appstream 23 k perl-File-stat noarch 1.14-513.el10 appstream 18 k perl-FileCache noarch 1.10-513.el10 appstream 16 k perl-FileHandle noarch 2.05-513.el10 appstream 16 k perl-Filter x86_64 2:1.64-512.el10 appstream 89 k perl-Filter-Simple noarch 0.96-511.el10 appstream 28 k perl-FindBin noarch 1.54-513.el10 appstream 15 k perl-GDBM_File x86_64 1:1.24-513.el10 appstream 43 k perl-Getopt-Long noarch 1:2.58-3.el10 baseos 65 k perl-Getopt-Std noarch 1.14-513.el10 appstream 17 k perl-Git noarch 2.47.1-1.el10 appstream 39 k perl-HTTP-Tiny noarch 0.088-512.el10 baseos 57 k perl-Hash-Util x86_64 0.32-513.el10 appstream 35 k perl-Hash-Util-FieldHash x86_64 1.27-513.el10 appstream 40 k perl-I18N-Collate noarch 1.02-513.el10 appstream 15 k perl-I18N-LangTags noarch 0.45-513.el10 appstream 53 k perl-I18N-Langinfo x86_64 0.24-513.el10 appstream 27 k perl-IO x86_64 1.55-513.el10 appstream 81 k perl-IO-Compress noarch 2.212-512.el10 appstream 306 k perl-IO-Compress-Lzma noarch 2.206-7.el10 appstream 78 k perl-IO-Socket-IP noarch 0.42-512.el10 baseos 43 k perl-IO-Socket-SSL noarch 2.085-3.el10 baseos 227 k perl-IO-Zlib noarch 1:1.15-511.el10 appstream 21 k perl-IPC-Cmd noarch 2:1.04-512.el10 appstream 41 k perl-IPC-Open3 noarch 1.22-513.el10 appstream 23 k perl-IPC-SysV x86_64 2.09-512.el10 appstream 42 k perl-IPC-System-Simple noarch 1.30-16.el10 appstream 40 k perl-JSON-PP noarch 1:4.16-512.el10 appstream 67 k perl-Locale-Maketext noarch 1.33-512.el10 appstream 94 k perl-Locale-Maketext-Simple noarch 1:0.21-513.el10 appstream 19 k perl-MIME-Base64 x86_64 3.16-511.el10 baseos 31 k perl-MRO-Compat noarch 0.15-12.el10 appstream 26 k perl-Math-BigInt noarch 1:2.0030.03-3.el10 appstream 228 k perl-Math-BigInt-FastCalc x86_64 0.501.800-511.el10 appstream 29 k perl-Math-Complex noarch 1.62-513.el10 appstream 47 k perl-Memoize noarch 1.16-513.el10 appstream 47 k perl-Module-Build noarch 2:0.42.34-7.el10 appstream 251 k perl-Module-CoreList noarch 1:5.20250120-1.el10 appstream 92 k perl-Module-CoreList-tools noarch 1:5.20250120-1.el10 appstream 19 k perl-Module-Load noarch 1:0.36-511.el10 appstream 18 k perl-Module-Load-Conditional noarch 0.74-511.el10 appstream 23 k perl-Module-Loaded noarch 1:0.08-513.el10 appstream 14 k perl-Module-Metadata noarch 1.000038-512.el10 appstream 36 k perl-Module-Signature noarch 0.88-12.el10 appstream 83 k perl-Mozilla-CA noarch 20231213-5.el10 baseos 15 k perl-NDBM_File x86_64 1.17-513.el10 appstream 24 k perl-NEXT noarch 0.69-513.el10 appstream 22 k perl-Net noarch 1.04-513.el10 appstream 24 k perl-Net-Ping noarch 2.76-511.el10 appstream 51 k perl-Net-SSLeay x86_64 1.94-7.el10 baseos 381 k perl-ODBM_File x86_64 1.18-513.el10 appstream 24 k perl-Opcode x86_64 1.65-513.el10 appstream 37 k perl-POSIX x86_64 2.20-513.el10 appstream 98 k perl-Package-Generator noarch 1.106-34.el10 appstream 23 k perl-Params-Check noarch 1:0.38-511.el10 appstream 23 k perl-Params-Util x86_64 1.102-19.el10 appstream 34 k perl-PathTools x86_64 3.91-512.el10 baseos 85 k perl-Perl-OSType noarch 1.010-512.el10 appstream 24 k perl-PerlIO-via-QuotedPrint noarch 0.10-511.el10 appstream 23 k perl-Pod-Checker noarch 4:1.77-511.el10 appstream 33 k perl-Pod-Escapes noarch 1:1.07-511.el10 baseos 21 k perl-Pod-Functions noarch 1.14-513.el10 appstream 16 k perl-Pod-Html noarch 1.35-513.el10 appstream 30 k perl-Pod-Perldoc noarch 3.28.01-512.el10 baseos 84 k perl-Pod-Simple noarch 1:3.45-511.el10 baseos 218 k perl-Pod-Usage noarch 4:2.03-511.el10 baseos 41 k perl-Safe noarch 2.46-513.el10 appstream 26 k perl-Scalar-List-Utils x86_64 5:1.63-511.el10 baseos 74 k perl-Search-Dict noarch 1.07-513.el10 appstream 14 k perl-SelectSaver noarch 1.02-513.el10 appstream 13 k perl-SelfLoader noarch 1.27-513.el10 appstream 23 k perl-Socket x86_64 4:2.038-511.el10 baseos 56 k perl-Software-License noarch 0.104006-4.el10 appstream 151 k perl-Storable x86_64 1:3.32-511.el10 baseos 99 k perl-Sub-Exporter noarch 0.991-7.el10 appstream 78 k perl-Sub-Install noarch 0.929-8.el10 appstream 24 k perl-Symbol noarch 1.09-513.el10 appstream 15 k perl-Sys-Hostname x86_64 1.25-513.el10 appstream 18 k perl-Sys-Syslog x86_64 0.36-512.el10 appstream 48 k perl-Term-ANSIColor noarch 5.01-512.el10 baseos 49 k perl-Term-Cap noarch 1.18-511.el10 baseos 23 k perl-Term-Complete noarch 1.403-513.el10 appstream 14 k perl-Term-ReadLine noarch 1.17-513.el10 appstream 20 k perl-Term-Table noarch 0.018-513.el10 appstream 44 k perl-TermReadKey x86_64 2.38-24.el10 appstream 37 k perl-Test noarch 1.31-513.el10 appstream 30 k perl-Test-Harness noarch 1:3.48-512.el10 appstream 284 k perl-Test-Simple noarch 3:1.302199-513.el10 appstream 611 k perl-Test2-Suite noarch 0.000163-4.el10 appstream 485 k perl-Text-Abbrev noarch 1.02-513.el10 appstream 13 k perl-Text-Balanced noarch 2.06-511.el10 appstream 50 k perl-Text-Diff noarch 1.45-24.el10 appstream 41 k perl-Text-Glob noarch 0.11-26.el10 appstream 14 k perl-Text-ParseWords noarch 3.31-511.el10 baseos 17 k perl-Text-Tabs+Wrap noarch 2024.001-511.el10 baseos 23 k perl-Text-Template noarch 1.61-8.el10 appstream 60 k perl-Thread noarch 3.05-513.el10 appstream 19 k perl-Thread-Queue noarch 3.14-511.el10 appstream 22 k perl-Thread-Semaphore noarch 2.13-513.el10 appstream 17 k perl-Tie noarch 4.6-513.el10 appstream 29 k perl-Tie-File noarch 1.09-513.el10 appstream 44 k perl-Tie-Memoize noarch 1.1-513.el10 appstream 15 k perl-Tie-RefHash noarch 1.40-511.el10 appstream 24 k perl-Time noarch 1.04-513.el10 appstream 18 k perl-Time-HiRes x86_64 4:1.9777-511.el10 appstream 59 k perl-Time-Local noarch 2:1.350-511.el10 baseos 35 k perl-Time-Piece x86_64 1.3401-513.el10 appstream 41 k perl-URI noarch 5.27-3.el10 baseos 134 k perl-Unicode-Collate x86_64 1.31-511.el10 appstream 658 k perl-Unicode-Normalize x86_64 1.32-511.el10 appstream 75 k perl-Unicode-UCD noarch 0.78-513.el10 appstream 79 k perl-User-pwent noarch 1.05-513.el10 appstream 20 k perl-autodie noarch 2.37-512.el10 appstream 95 k perl-autouse noarch 1.11-513.el10 appstream 15 k perl-base noarch 2.27-513.el10 appstream 17 k perl-bignum noarch 0.67-512.el10 appstream 51 k perl-blib noarch 1.07-513.el10 appstream 13 k perl-constant noarch 1.33-512.el10 baseos 24 k perl-debugger noarch 1.60-513.el10 appstream 134 k perl-deprecate noarch 0.04-513.el10 appstream 16 k perl-devel x86_64 4:5.40.2-513.el10 appstream 768 k perl-diagnostics noarch 1.40-513.el10 appstream 218 k perl-doc noarch 5.40.2-513.el10 appstream 4.9 M perl-encoding x86_64 4:3.00-511.el10 appstream 64 k perl-encoding-warnings noarch 0.14-513.el10 appstream 18 k perl-experimental noarch 0.032-511.el10 appstream 27 k perl-fields noarch 2.27-513.el10 appstream 17 k perl-filetest noarch 1.03-513.el10 appstream 16 k perl-if noarch 0.61.000-513.el10 appstream 15 k perl-inc-latest noarch 2:0.500-30.el10 appstream 24 k perl-interpreter x86_64 4:5.40.2-513.el10 appstream 73 k perl-less noarch 0.03-513.el10 appstream 14 k perl-lib x86_64 0.65-513.el10 appstream 16 k perl-libnet noarch 3.15-512.el10 baseos 127 k perl-libnetcfg noarch 4:5.40.2-513.el10 appstream 17 k perl-libs x86_64 4:5.40.2-513.el10 appstream 2.4 M perl-local-lib noarch 2.000029-9.el10 appstream 67 k perl-locale noarch 1.12-513.el10 appstream 15 k perl-macros noarch 4:5.40.2-513.el10 appstream 13 k perl-meta-notation noarch 5.40.2-513.el10 appstream 12 k perl-mro x86_64 1.29-513.el10 appstream 31 k perl-open noarch 1.13-513.el10 appstream 17 k perl-overload noarch 1.37-513.el10 appstream 46 k perl-overloading noarch 0.02-513.el10 appstream 14 k perl-parent noarch 1:0.241-512.el10 baseos 16 k perl-perlfaq noarch 5.20240218-511.el10 appstream 376 k perl-ph x86_64 5.40.2-513.el10 appstream 47 k perl-podlators noarch 1:5.01-511.el10 baseos 124 k perl-sigtrap noarch 1.10-513.el10 appstream 17 k perl-sort noarch 2.05-513.el10 appstream 14 k perl-subs noarch 1.04-513.el10 appstream 13 k perl-threads x86_64 1:2.40-511.el10 appstream 59 k perl-threads-shared x86_64 1.69-511.el10 appstream 46 k perl-utils noarch 5.40.2-513.el10 appstream 53 k perl-vars noarch 1.05-513.el10 appstream 14 k perl-version x86_64 8:0.99.32-4.el10 appstream 64 k perl-vmsish noarch 1.04-513.el10 appstream 15 k procps-ng x86_64 4.0.4-7.el10 baseos 349 k python3 x86_64 3.12.10-2.el10 baseos 30 k python3-libs x86_64 3.12.10-2.el10 baseos 9.4 M python3-pip-wheel noarch 23.3.2-7.el10 baseos 1.5 M python3-pyparsing noarch 3.1.1-7.el10 baseos 269 k rocm-clang x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 16 M rocm-clang-devel x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 2.4 M rocm-clang-libs x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 23 M rocm-clang-runtime-devel x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 493 k rocm-comgr x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 31 M rocm-core x86_64 6.4.0-1.el10 copr_base 13 k rocm-device-libs x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 490 k rocm-hip x86_64 6.4.0-3.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 9.4 M rocm-libc++ x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 346 k rocm-libc++-devel x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 904 k rocm-lld x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 1.5 M rocm-llvm x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 13 M rocm-llvm-devel x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 3.8 M rocm-llvm-filesystem x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 22 k rocm-llvm-libs x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 20 M rocm-llvm-static x86_64 19-6.rocm6.4.0.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 30 M rocm-runtime x86_64 6.4.0-1.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 646 k rocm-smi x86_64 6.4.0-2.el10 https_download_copr_fedorainfracloud_org_results_rocm_packagers_sig_cs10_mathlib_preview_centos_stream_10_x86_64 600 k systemtap-sdt-devel x86_64 5.2-2.el10 appstream 76 k systemtap-sdt-dtrace x86_64 5.2-2.el10 appstream 70 k tcl x86_64 1:8.6.13-4.el10 baseos 1.1 M tpm2-tss x86_64 4.1.3-5.el10 baseos 438 k tzdata noarch 2025b-1.el10 baseos 839 k vim-filesystem noarch 2:9.1.083-5.el10 baseos 19 k zlib-ng-compat-devel x86_64 2.2.3-2.el10 appstream 39 k Transaction Summary ======================================================================================================================================================================================= Install 305 Packages Total download size: 298 M Installed size: 1.2 G Downloading Packages: (1/305): rocm-core-6.4.0-1.el10.x86_64.rpm 677 kB/s | 13 kB 00:00 (2/305): rocm-core-devel-6.4.0-1.el10.x86_64.rp 433 kB/s | 13 kB 00:00 (3/305): hipify-6.4.0-1.el10.x86_64.rpm 6.5 MB/s | 505 kB 00:00 (4/305): rocm-clang-devel-19-6.rocm6.4.0.el10.x 144 MB/s | 2.4 MB 00:00 (5/305): rocm-clang-19-6.rocm6.4.0.el10.x86_64. 159 MB/s | 16 MB 00:00 (6/305): rocm-clang-runtime-devel-19-6.rocm6.4. 31 MB/s | 493 kB 00:00 (7/305): rocm-cmake-6.4.0-1.el10.noarch.rpm 1.5 MB/s | 38 kB 00:00 (8/305): rocm-clang-libs-19-6.rocm6.4.0.el10.x8 223 MB/s | 23 MB 00:00 (9/305): rocm-comgr-devel-19-6.rocm6.4.0.el10.x 105 kB/s | 32 kB 00:00 (10/305): rocm-device-libs-19-6.rocm6.4.0.el10. 7.2 MB/s | 490 kB 00:00 (11/305): rocm-hip-6.4.0-3.el10.x86_64.rpm 219 MB/s | 9.4 MB 00:00 (12/305): rocm-hip-devel-6.4.0-3.el10.x86_64.rp 17 MB/s | 233 kB 00:00 (13/305): rocm-libc++-19-6.rocm6.4.0.el10.x86_6 48 MB/s | 346 kB 00:00 (14/305): rocm-libc++-devel-19-6.rocm6.4.0.el10 60 MB/s | 904 kB 00:00 (15/305): rocm-lld-19-6.rocm6.4.0.el10.x86_64.r 20 MB/s | 1.5 MB 00:00 (16/305): rocm-llvm-19-6.rocm6.4.0.el10.x86_64. 244 MB/s | 13 MB 00:00 (17/305): rocm-llvm-devel-19-6.rocm6.4.0.el10.x 142 MB/s | 3.8 MB 00:00 (18/305): rocm-llvm-filesystem-19-6.rocm6.4.0.e 3.8 MB/s | 22 kB 00:00 (19/305): rocm-llvm-libs-19-6.rocm6.4.0.el10.x8 290 MB/s | 20 MB 00:00 (20/305): hipcc-19-6.rocm6.4.0.el10.x86_64.rpm 135 kB/s | 134 kB 00:00 (21/305): rocm-rpm-macros-6.4.0-4.limitedisa.el 677 kB/s | 16 kB 00:00 (22/305): rocm-runtime-6.4.0-1.el10.x86_64.rpm 149 MB/s | 646 kB 00:00 (23/305): rocm-runtime-devel-6.4.0-1.el10.x86_6 28 MB/s | 94 kB 00:00 (24/305): rocm-llvm-static-19-6.rocm6.4.0.el10. 132 MB/s | 30 MB 00:00 (25/305): rocm-smi-devel-6.4.0-2.el10.x86_64.rp 3.0 MB/s | 57 kB 00:00 (26/305): environment-modules-5.3.1-8.el10.x86_ 37 MB/s | 707 kB 00:00 (27/305): expat-2.6.4-1.el10.x86_64.rpm 26 MB/s | 115 kB 00:00 (28/305): gnupg2-2.4.5-2.el10.x86_64.rpm 195 MB/s | 2.7 MB 00:00 (29/305): gnutls-3.8.9-14.el10.x86_64.rpm 158 MB/s | 1.4 MB 00:00 (30/305): groff-base-1.23.0-10.el10.x86_64.rpm 132 MB/s | 1.1 MB 00:00 (31/305): hwdata-0.379-10.4.el10.noarch.rpm 166 MB/s | 1.6 MB 00:00 (32/305): less-661-3.el10.x86_64.rpm 43 MB/s | 191 kB 00:00 (33/305): libassuan-2.5.6-6.el10.x86_64.rpm 17 MB/s | 69 kB 00:00 (34/305): libcbor-0.11.0-3.el10.x86_64.rpm 9.2 MB/s | 34 kB 00:00 (35/305): libedit-3.1-52.20230828cvs.el10.x86_6 22 MB/s | 106 kB 00:00 (36/305): libfido2-1.14.0-7.el10.x86_64.rpm 25 MB/s | 98 kB 00:00 (37/305): libgcrypt-1.11.0-6.el10.x86_64.rpm 113 MB/s | 587 kB 00:00 (38/305): libgpg-error-1.50-2.el10.x86_64.rpm 48 MB/s | 236 kB 00:00 (39/305): libksba-1.6.7-2.el10.x86_64.rpm 39 MB/s | 159 kB 00:00 (40/305): libpciaccess-0.16-16.el10.x86_64.rpm 11 MB/s | 28 kB 00:00 (41/305): libpipeline-1.5.7-7.el10.x86_64.rpm 16 MB/s | 53 kB 00:00 (42/305): make-4.4.1-9.el10.x86_64.rpm 98 MB/s | 587 kB 00:00 (43/305): man-db-2.12.0-8.el10.x86_64.rpm 117 MB/s | 1.3 MB 00:00 (44/305): mpdecimal-2.5.1-12.el10.x86_64.rpm 22 MB/s | 90 kB 00:00 (45/305): ncurses-6.4-14.20240127.el10.x86_64.r 78 MB/s | 423 kB 00:00 (46/305): npth-1.6-21.el10.x86_64.rpm 9.7 MB/s | 26 kB 00:00 (47/305): numactl-libs-2.0.19-1.el10.x86_64.rpm 13 MB/s | 32 kB 00:00 (48/305): openssh-9.9p1-9.el10.x86_64.rpm 80 MB/s | 353 kB 00:00 (49/305): openssh-clients-9.9p1-9.el10.x86_64.r 120 MB/s | 763 kB 00:00 (50/305): perl-Carp-1.54-511.el10.noarch.rpm 11 MB/s | 30 kB 00:00 (51/305): perl-Data-Dumper-2.189-512.el10.x86_6 15 MB/s | 57 kB 00:00 (52/305): perl-Digest-1.20-511.el10.noarch.rpm 8.4 MB/s | 26 kB 00:00 (53/305): perl-Digest-MD5-2.59-6.el10.x86_64.rp 13 MB/s | 37 kB 00:00 (54/305): perl-Encode-3.21-511.el10.x86_64.rpm 168 MB/s | 1.1 MB 00:00 (55/305): perl-Exporter-5.78-511.el10.noarch.rp 7.2 MB/s | 32 kB 00:00 (56/305): perl-File-Path-2.18-511.el10.noarch.r 12 MB/s | 36 kB 00:00 (57/305): perl-File-Temp-0.231.100-512.el10.noa 24 MB/s | 60 kB 00:00 (58/305): perl-Getopt-Long-2.58-3.el10.noarch.r 15 MB/s | 65 kB 00:00 (59/305): perl-HTTP-Tiny-0.088-512.el10.noarch. 8.6 MB/s | 57 kB 00:00 (60/305): perl-IO-Socket-IP-0.42-512.el10.noarc 13 MB/s | 43 kB 00:00 (61/305): perl-IO-Socket-SSL-2.085-3.el10.noarc 43 MB/s | 227 kB 00:00 (62/305): perl-MIME-Base64-3.16-511.el10.x86_64 12 MB/s | 31 kB 00:00 (63/305): perl-Mozilla-CA-20231213-5.el10.noarc 4.1 MB/s | 15 kB 00:00 (64/305): perl-Net-SSLeay-1.94-7.el10.x86_64.rp 86 MB/s | 381 kB 00:00 (65/305): perl-PathTools-3.91-512.el10.x86_64.r 20 MB/s | 85 kB 00:00 (66/305): perl-Pod-Escapes-1.07-511.el10.noarch 8.2 MB/s | 21 kB 00:00 (67/305): perl-Pod-Perldoc-3.28.01-512.el10.noa 20 MB/s | 84 kB 00:00 (68/305): perl-Pod-Simple-3.45-511.el10.noarch. 49 MB/s | 218 kB 00:00 (69/305): perl-Pod-Usage-2.03-511.el10.noarch.r 17 MB/s | 41 kB 00:00 (70/305): perl-Scalar-List-Utils-1.63-511.el10. 18 MB/s | 74 kB 00:00 (71/305): perl-Socket-2.038-511.el10.x86_64.rpm 23 MB/s | 56 kB 00:00 (72/305): perl-Storable-3.32-511.el10.x86_64.rp 20 MB/s | 99 kB 00:00 (73/305): perl-Term-ANSIColor-5.01-512.el10.noa 20 MB/s | 49 kB 00:00 (74/305): perl-Term-Cap-1.18-511.el10.noarch.rp 9.0 MB/s | 23 kB 00:00 (75/305): perl-Text-ParseWords-3.31-511.el10.no 6.9 MB/s | 17 kB 00:00 (76/305): perl-Text-Tabs+Wrap-2024.001-511.el10 7.9 MB/s | 23 kB 00:00 (77/305): perl-Time-Local-1.350-511.el10.noarch 14 MB/s | 35 kB 00:00 (78/305): perl-URI-5.27-3.el10.noarch.rpm 28 MB/s | 134 kB 00:00 (79/305): perl-constant-1.33-512.el10.noarch.rp 9.8 MB/s | 24 kB 00:00 (80/305): perl-libnet-3.15-512.el10.noarch.rpm 30 MB/s | 127 kB 00:00 (81/305): perl-parent-0.241-512.el10.noarch.rpm 6.2 MB/s | 16 kB 00:00 (82/305): perl-podlators-5.01-511.el10.noarch.r 30 MB/s | 124 kB 00:00 (83/305): procps-ng-4.0.4-7.el10.x86_64.rpm 71 MB/s | 349 kB 00:00 (84/305): python3-3.12.10-2.el10.x86_64.rpm 10 MB/s | 30 kB 00:00 (85/305): python3-libs-3.12.10-2.el10.x86_64.rp 221 MB/s | 9.4 MB 00:00 (86/305): python3-pip-wheel-23.3.2-7.el10.noarc 179 MB/s | 1.5 MB 00:00 (87/305): python3-pyparsing-3.1.1-7.el10.noarch 58 MB/s | 269 kB 00:00 (88/305): tcl-8.6.13-4.el10.x86_64.rpm 109 MB/s | 1.1 MB 00:00 (89/305): tpm2-tss-4.1.3-5.el10.x86_64.rpm 69 MB/s | 438 kB 00:00 (90/305): tzdata-2025b-1.el10.noarch.rpm 115 MB/s | 839 kB 00:00 (91/305): vim-filesystem-9.1.083-5.el10.noarch. 8.0 MB/s | 19 kB 00:00 (92/305): annobin-docs-12.93-1.el10.noarch.rpm 24 MB/s | 92 kB 00:00 (93/305): annobin-plugin-gcc-12.93-1.el10.x86_6 143 MB/s | 987 kB 00:00 (94/305): cmake-3.30.5-3.el10.x86_64.rpm 305 MB/s | 12 MB 00:00 (95/305): cmake-data-3.30.5-3.el10.noarch.rpm 225 MB/s | 2.5 MB 00:00 (96/305): cmake-filesystem-3.30.5-3.el10.x86_64 6.4 MB/s | 23 kB 00:00 (97/305): cmake-rpm-macros-3.30.5-3.el10.noarch 7.2 MB/s | 16 kB 00:00 (98/305): cpp-14.2.1-7.el10.x86_64.rpm 319 MB/s | 12 MB 00:00 (99/305): emacs-filesystem-29.4-10.el10.noarch. 3.0 MB/s | 10 kB 00:00 (100/305): gcc-14.2.1-7.el10.x86_64.rpm 278 MB/s | 37 MB 00:00 (101/305): gcc-c++-14.2.1-7.el10.x86_64.rpm 311 MB/s | 15 MB 00:00 (102/305): gcc-plugin-annobin-14.2.1-7.el10.x86 21 MB/s | 60 kB 00:00 (103/305): git-2.47.1-1.el10.x86_64.rpm 20 MB/s | 52 kB 00:00 (104/305): git-core-2.47.1-1.el10.x86_64.rpm 295 MB/s | 4.7 MB 00:00 (105/305): git-core-doc-2.47.1-1.el10.noarch.rp 239 MB/s | 3.1 MB 00:00 (106/305): glibc-devel-2.39-37.el10.x86_64.rpm 119 MB/s | 637 kB 00:00 (107/305): kernel-headers-6.12.0-80.el10.x86_64 210 MB/s | 2.2 MB 00:00 (108/305): libdrm-2.4.123-1.el10.x86_64.rpm 36 MB/s | 163 kB 00:00 (109/305): libdrm-devel-2.4.123-1.el10.x86_64.r 361 kB/s | 181 kB 00:00 (110/305): libmpc-1.3.1-7.el10.x86_64.rpm 18 MB/s | 71 kB 00:00 (111/305): libpciaccess-devel-0.16-16.el10.x86_ 4.8 MB/s | 14 kB 00:00 (112/305): libstdc++-devel-14.2.1-7.el10.x86_64 233 MB/s | 2.8 MB 00:00 (113/305): libuv-1.49.2-2.el10.x86_64.rpm 51 MB/s | 260 kB 00:00 (114/305): libxcrypt-devel-4.4.36-10.el10.x86_6 12 MB/s | 30 kB 00:00 (115/305): perl-5.40.2-513.el10.x86_64.rpm 4.3 MB/s | 11 kB 00:00 (116/305): perl-Algorithm-Diff-1.2010-14.el10.n 15 MB/s | 47 kB 00:00 (117/305): perl-Archive-Tar-3.02-512.el10.noarc 19 MB/s | 72 kB 00:00 (118/305): perl-Archive-Zip-1.68-17.el10.noarch 29 MB/s | 111 kB 00:00 (119/305): perl-Attribute-Handlers-1.03-513.el1 11 MB/s | 29 kB 00:00 (120/305): perl-AutoLoader-5.74-513.el10.noarch 9.3 MB/s | 22 kB 00:00 (121/305): perl-AutoSplit-5.74-513.el10.noarch. 8.6 MB/s | 23 kB 00:00 (122/305): perl-B-1.89-513.el10.x86_64.rpm 42 MB/s | 178 kB 00:00 (123/305): perl-Benchmark-1.25-513.el10.noarch. 11 MB/s | 28 kB 00:00 (124/305): perl-CPAN-2.36-512.el10.noarch.rpm 112 MB/s | 567 kB 00:00 (125/305): perl-CPAN-Meta-2.150010-511.el10.noa 37 MB/s | 198 kB 00:00 (126/305): perl-CPAN-Meta-Requirements-2.143-11 13 MB/s | 36 kB 00:00 (127/305): perl-CPAN-Meta-YAML-0.018-512.el10.n 10 MB/s | 27 kB 00:00 (128/305): perl-Class-Struct-0.68-513.el10.noar 8.8 MB/s | 23 kB 00:00 (129/305): perl-Compress-Bzip2-2.28-21.el10.x86 17 MB/s | 68 kB 00:00 (130/305): perl-Compress-Raw-Bzip2-2.212-512.el 14 MB/s | 36 kB 00:00 (131/305): perl-Compress-Raw-Lzma-2.212-3.el10. 19 MB/s | 52 kB 00:00 (132/305): perl-Compress-Raw-Zlib-2.212-512.el1 15 MB/s | 66 kB 00:00 (133/305): perl-Config-Extensions-0.03-513.el10 5.1 MB/s | 13 kB 00:00 (134/305): perl-Config-Perl-V-0.36-512.el10.noa 7.9 MB/s | 23 kB 00:00 (135/305): perl-DBM_Filter-0.06-513.el10.noarch 11 MB/s | 28 kB 00:00 (136/305): perl-Data-OptList-0.114-7.el10.noarc 11 MB/s | 28 kB 00:00 (137/305): perl-Data-Section-0.200008-8.el10.no 11 MB/s | 26 kB 00:00 (138/305): perl-Devel-PPPort-3.72-512.el10.x86_ 43 MB/s | 220 kB 00:00 (139/305): perl-Devel-Peek-1.34-513.el10.x86_64 12 MB/s | 33 kB 00:00 (140/305): perl-Devel-SelfStubber-1.06-513.el10 6.0 MB/s | 15 kB 00:00 (141/305): perl-Devel-Size-0.84-3.el10.x86_64.r 13 MB/s | 32 kB 00:00 (142/305): perl-Digest-SHA-6.04-512.el10.x86_64 24 MB/s | 63 kB 00:00 (143/305): perl-DirHandle-1.05-513.el10.noarch. 5.4 MB/s | 13 kB 00:00 (144/305): perl-Dumpvalue-2.27-513.el10.noarch. 4.9 MB/s | 19 kB 00:00 (145/305): perl-DynaLoader-1.56-513.el10.x86_64 11 MB/s | 27 kB 00:00 (146/305): perl-Encode-devel-3.21-511.el10.x86_ 15 MB/s | 42 kB 00:00 (147/305): perl-English-1.11-513.el10.noarch.rp 5.9 MB/s | 15 kB 00:00 (148/305): perl-Env-1.06-511.el10.noarch.rpm 6.6 MB/s | 21 kB 00:00 (149/305): perl-Errno-1.38-513.el10.x86_64.rpm 6.7 MB/s | 16 kB 00:00 (150/305): perl-Error-0.17029-18.el10.noarch.rp 15 MB/s | 42 kB 00:00 (151/305): perl-ExtUtils-CBuilder-0.280240-511. 17 MB/s | 50 kB 00:00 (152/305): perl-ExtUtils-Command-7.70-513.el10. 6.1 MB/s | 15 kB 00:00 (153/305): perl-ExtUtils-Constant-0.25-513.el10 16 MB/s | 45 kB 00:00 (154/305): perl-ExtUtils-Embed-1.35-513.el10.no 7.5 MB/s | 19 kB 00:00 (155/305): perl-ExtUtils-Install-2.22-511.el10. 18 MB/s | 44 kB 00:00 (156/305): perl-ExtUtils-MM-Utils-7.70-513.el10 5.2 MB/s | 13 kB 00:00 (157/305): perl-ExtUtils-MakeMaker-7.70-513.el1 60 MB/s | 293 kB 00:00 (158/305): perl-ExtUtils-Manifest-1.75-511.el10 5.7 MB/s | 35 kB 00:00 (159/305): perl-ExtUtils-Miniperl-1.14-513.el10 5.9 MB/s | 16 kB 00:00 (160/305): perl-ExtUtils-ParseXS-3.51-512.el10. 39 MB/s | 186 kB 00:00 (161/305): perl-Fcntl-1.18-513.el10.x86_64.rpm 12 MB/s | 31 kB 00:00 (162/305): perl-File-Basename-2.86-513.el10.noa 7.4 MB/s | 18 kB 00:00 (163/305): perl-File-Compare-1.100.800-513.el10 5.8 MB/s | 14 kB 00:00 (164/305): perl-File-Copy-2.41-513.el10.noarch. 5.1 MB/s | 21 kB 00:00 (165/305): perl-File-DosGlob-1.12-513.el10.x86_ 8.2 MB/s | 21 kB 00:00 (166/305): perl-File-Fetch-1.04-511.el10.noarch 12 MB/s | 31 kB 00:00 (167/305): perl-File-Find-1.44-513.el10.noarch. 11 MB/s | 26 kB 00:00 (168/305): perl-File-HomeDir-1.006-15.el10.noar 23 MB/s | 58 kB 00:00 (169/305): perl-File-Which-1.27-14.el10.noarch. 9.5 MB/s | 23 kB 00:00 (170/305): perl-File-stat-1.14-513.el10.noarch. 7.2 MB/s | 18 kB 00:00 (171/305): perl-FileCache-1.10-513.el10.noarch. 6.4 MB/s | 16 kB 00:00 (172/305): perl-FileHandle-2.05-513.el10.noarch 6.2 MB/s | 16 kB 00:00 (173/305): perl-Filter-1.64-512.el10.x86_64.rpm 21 MB/s | 89 kB 00:00 (174/305): perl-Filter-Simple-0.96-511.el10.noa 11 MB/s | 28 kB 00:00 (175/305): perl-FindBin-1.54-513.el10.noarch.rp 5.7 MB/s | 15 kB 00:00 (176/305): perl-GDBM_File-1.24-513.el10.x86_64. 17 MB/s | 43 kB 00:00 (177/305): perl-Getopt-Std-1.14-513.el10.noarch 7.6 MB/s | 17 kB 00:00 (178/305): perl-Git-2.47.1-1.el10.noarch.rpm 16 MB/s | 39 kB 00:00 (179/305): perl-Hash-Util-0.32-513.el10.x86_64. 15 MB/s | 35 kB 00:00 (180/305): perl-Hash-Util-FieldHash-1.27-513.el 13 MB/s | 40 kB 00:00 (181/305): perl-I18N-Collate-1.02-513.el10.noar 6.6 MB/s | 15 kB 00:00 (182/305): perl-I18N-LangTags-0.45-513.el10.noa 22 MB/s | 53 kB 00:00 (183/305): perl-I18N-Langinfo-0.24-513.el10.x86 11 MB/s | 27 kB 00:00 (184/305): perl-IO-1.55-513.el10.x86_64.rpm 19 MB/s | 81 kB 00:00 (185/305): perl-IO-Compress-2.212-512.el10.noar 62 MB/s | 306 kB 00:00 (186/305): perl-IO-Compress-Lzma-2.206-7.el10.n 19 MB/s | 78 kB 00:00 (187/305): perl-IO-Zlib-1.15-511.el10.noarch.rp 8.5 MB/s | 21 kB 00:00 (188/305): perl-IPC-Cmd-1.04-512.el10.noarch.rp 16 MB/s | 41 kB 00:00 (189/305): perl-IPC-Open3-1.22-513.el10.noarch. 9.3 MB/s | 23 kB 00:00 (190/305): perl-IPC-SysV-2.09-512.el10.x86_64.r 18 MB/s | 42 kB 00:00 (191/305): perl-IPC-System-Simple-1.30-16.el10. 15 MB/s | 40 kB 00:00 (192/305): perl-JSON-PP-4.16-512.el10.noarch.rp 18 MB/s | 67 kB 00:00 (193/305): perl-Locale-Maketext-1.33-512.el10.n 24 MB/s | 94 kB 00:00 (194/305): perl-Locale-Maketext-Simple-0.21-513 7.1 MB/s | 19 kB 00:00 (195/305): perl-MRO-Compat-0.15-12.el10.noarch. 11 MB/s | 26 kB 00:00 (196/305): perl-Math-BigInt-2.0030.03-3.el10.no 53 MB/s | 228 kB 00:00 (197/305): perl-Math-BigInt-FastCalc-0.501.800- 11 MB/s | 29 kB 00:00 (198/305): perl-Math-Complex-1.62-513.el10.noar 17 MB/s | 47 kB 00:00 (199/305): perl-Memoize-1.16-513.el10.noarch.rp 11 MB/s | 47 kB 00:00 (200/305): perl-Module-Build-0.42.34-7.el10.noa 54 MB/s | 251 kB 00:00 (201/305): perl-Module-CoreList-5.20250120-1.el 22 MB/s | 92 kB 00:00 (202/305): perl-Module-CoreList-tools-5.2025012 6.4 MB/s | 19 kB 00:00 (203/305): perl-Module-Load-0.36-511.el10.noarc 7.6 MB/s | 18 kB 00:00 (204/305): perl-Module-Load-Conditional-0.74-51 8.3 MB/s | 23 kB 00:00 (205/305): perl-Module-Loaded-0.08-513.el10.noa 6.0 MB/s | 14 kB 00:00 (206/305): perl-Module-Metadata-1.000038-512.el 15 MB/s | 36 kB 00:00 (207/305): perl-Module-Signature-0.88-12.el10.n 20 MB/s | 83 kB 00:00 (208/305): perl-NDBM_File-1.17-513.el10.x86_64. 9.2 MB/s | 24 kB 00:00 (209/305): perl-NEXT-0.69-513.el10.noarch.rpm 7.8 MB/s | 22 kB 00:00 (210/305): perl-Net-1.04-513.el10.noarch.rpm 11 MB/s | 24 kB 00:00 (211/305): perl-Net-Ping-2.76-511.el10.noarch.r 20 MB/s | 51 kB 00:00 (212/305): perl-ODBM_File-1.18-513.el10.x86_64. 9.0 MB/s | 24 kB 00:00 (213/305): perl-Opcode-1.65-513.el10.x86_64.rpm 13 MB/s | 37 kB 00:00 (214/305): perl-POSIX-2.20-513.el10.x86_64.rpm 21 MB/s | 98 kB 00:00 (215/305): perl-Package-Generator-1.106-34.el10 6.9 MB/s | 23 kB 00:00 (216/305): perl-Params-Check-0.38-511.el10.noar 9.7 MB/s | 23 kB 00:00 (217/305): perl-Params-Util-1.102-19.el10.x86_6 12 MB/s | 34 kB 00:00 (218/305): perl-Perl-OSType-1.010-512.el10.noar 9.6 MB/s | 24 kB 00:00 (219/305): perl-PerlIO-via-QuotedPrint-0.10-511 8.5 MB/s | 23 kB 00:00 (220/305): perl-Pod-Checker-1.77-511.el10.noarc 12 MB/s | 33 kB 00:00 (221/305): perl-Pod-Functions-1.14-513.el10.noa 5.8 MB/s | 16 kB 00:00 (222/305): perl-Pod-Html-1.35-513.el10.noarch.r 9.2 MB/s | 30 kB 00:00 (223/305): perl-Safe-2.46-513.el10.noarch.rpm 8.6 MB/s | 26 kB 00:00 (224/305): perl-Search-Dict-1.07-513.el10.noarc 5.5 MB/s | 14 kB 00:00 (225/305): perl-SelectSaver-1.02-513.el10.noarc 4.5 MB/s | 13 kB 00:00 (226/305): perl-SelfLoader-1.27-513.el10.noarch 6.6 MB/s | 23 kB 00:00 (227/305): perl-Software-License-0.104006-4.el1 38 MB/s | 151 kB 00:00 (228/305): perl-Sub-Exporter-0.991-7.el10.noarc 19 MB/s | 78 kB 00:00 (229/305): perl-Sub-Install-0.929-8.el10.noarch 9.4 MB/s | 24 kB 00:00 (230/305): perl-Symbol-1.09-513.el10.noarch.rpm 5.0 MB/s | 15 kB 00:00 (231/305): perl-Sys-Hostname-1.25-513.el10.x86_ 6.1 MB/s | 18 kB 00:00 (232/305): perl-Sys-Syslog-0.36-512.el10.x86_64 20 MB/s | 48 kB 00:00 (233/305): perl-Term-Complete-1.403-513.el10.no 5.6 MB/s | 14 kB 00:00 (234/305): perl-Term-ReadLine-1.17-513.el10.noa 8.5 MB/s | 20 kB 00:00 (235/305): perl-Term-Table-0.018-513.el10.noarc 19 MB/s | 44 kB 00:00 (236/305): perl-TermReadKey-2.38-24.el10.x86_64 15 MB/s | 37 kB 00:00 (237/305): perl-Test-1.31-513.el10.noarch.rpm 12 MB/s | 30 kB 00:00 (238/305): perl-Test-Harness-3.48-512.el10.noar 66 MB/s | 284 kB 00:00 (239/305): perl-Test-Simple-1.302199-513.el10.n 113 MB/s | 611 kB 00:00 (240/305): perl-Test2-Suite-0.000163-4.el10.noa 77 MB/s | 485 kB 00:00 (241/305): perl-Text-Abbrev-1.02-513.el10.noarc 5.1 MB/s | 13 kB 00:00 (242/305): perl-Text-Balanced-2.06-511.el10.noa 20 MB/s | 50 kB 00:00 (243/305): perl-Text-Diff-1.45-24.el10.noarch.r 16 MB/s | 41 kB 00:00 (244/305): perl-Text-Glob-0.11-26.el10.noarch.r 5.6 MB/s | 14 kB 00:00 (245/305): perl-Text-Template-1.61-8.el10.noarc 20 MB/s | 60 kB 00:00 (246/305): perl-Thread-3.05-513.el10.noarch.rpm 7.8 MB/s | 19 kB 00:00 (247/305): perl-Thread-Queue-3.14-511.el10.noar 8.1 MB/s | 22 kB 00:00 (248/305): perl-Thread-Semaphore-2.13-513.el10. 6.3 MB/s | 17 kB 00:00 (249/305): perl-Tie-4.6-513.el10.noarch.rpm 12 MB/s | 29 kB 00:00 (250/305): perl-Tie-File-1.09-513.el10.noarch.r 16 MB/s | 44 kB 00:00 (251/305): perl-Tie-Memoize-1.1-513.el10.noarch 4.1 MB/s | 15 kB 00:00 (252/305): perl-Tie-RefHash-1.40-511.el10.noarc 8.0 MB/s | 24 kB 00:00 (253/305): perl-Time-1.04-513.el10.noarch.rpm 6.6 MB/s | 18 kB 00:00 (254/305): perl-Time-HiRes-1.9777-511.el10.x86_ 19 MB/s | 59 kB 00:00 (255/305): perl-Time-Piece-1.3401-513.el10.x86_ 16 MB/s | 41 kB 00:00 (256/305): perl-Unicode-Collate-1.31-511.el10.x 113 MB/s | 658 kB 00:00 (257/305): perl-Unicode-Normalize-1.32-511.el10 19 MB/s | 75 kB 00:00 (258/305): perl-Unicode-UCD-0.78-513.el10.noarc 18 MB/s | 79 kB 00:00 (259/305): perl-User-pwent-1.05-513.el10.noarch 8.4 MB/s | 20 kB 00:00 (260/305): perl-autodie-2.37-512.el10.noarch.rp 25 MB/s | 95 kB 00:00 (261/305): perl-autouse-1.11-513.el10.noarch.rp 6.2 MB/s | 15 kB 00:00 (262/305): perl-base-2.27-513.el10.noarch.rpm 7.4 MB/s | 17 kB 00:00 (263/305): perl-bignum-0.67-512.el10.noarch.rpm 18 MB/s | 51 kB 00:00 (264/305): perl-blib-1.07-513.el10.noarch.rpm 4.6 MB/s | 13 kB 00:00 (265/305): perl-debugger-1.60-513.el10.noarch.r 28 MB/s | 134 kB 00:00 (266/305): perl-deprecate-0.04-513.el10.noarch. 5.4 MB/s | 16 kB 00:00 (267/305): perl-devel-5.40.2-513.el10.x86_64.rp 133 MB/s | 768 kB 00:00 (268/305): perl-diagnostics-1.40-513.el10.noarc 50 MB/s | 218 kB 00:00 (269/305): perl-doc-5.40.2-513.el10.noarch.rpm 181 MB/s | 4.9 MB 00:00 (270/305): perl-encoding-3.00-511.el10.x86_64.r 24 MB/s | 64 kB 00:00 (271/305): perl-encoding-warnings-0.14-513.el10 6.6 MB/s | 18 kB 00:00 (272/305): perl-experimental-0.032-511.el10.noa 10 MB/s | 27 kB 00:00 (273/305): perl-fields-2.27-513.el10.noarch.rpm 6.1 MB/s | 17 kB 00:00 (274/305): perl-filetest-1.03-513.el10.noarch.r 6.0 MB/s | 16 kB 00:00 (275/305): perl-if-0.61.000-513.el10.noarch.rpm 5.8 MB/s | 15 kB 00:00 (276/305): perl-inc-latest-0.500-30.el10.noarch 9.5 MB/s | 24 kB 00:00 (277/305): perl-interpreter-5.40.2-513.el10.x86 18 MB/s | 73 kB 00:00 (278/305): perl-less-0.03-513.el10.noarch.rpm 5.4 MB/s | 14 kB 00:00 (279/305): perl-lib-0.65-513.el10.x86_64.rpm 6.6 MB/s | 16 kB 00:00 (280/305): perl-libnetcfg-5.40.2-513.el10.noarc 7.1 MB/s | 17 kB 00:00 (281/305): perl-libs-5.40.2-513.el10.x86_64.rpm 101 MB/s | 2.4 MB 00:00 (282/305): perl-local-lib-2.000029-9.el10.noarc 16 MB/s | 67 kB 00:00 (283/305): perl-locale-1.12-513.el10.noarch.rpm 6.3 MB/s | 15 kB 00:00 (284/305): perl-macros-5.40.2-513.el10.noarch.r 5.6 MB/s | 13 kB 00:00 (285/305): perl-meta-notation-5.40.2-513.el10.n 4.6 MB/s | 12 kB 00:00 (286/305): perl-mro-1.29-513.el10.x86_64.rpm 11 MB/s | 31 kB 00:00 (287/305): perl-open-1.13-513.el10.noarch.rpm 7.2 MB/s | 17 kB 00:00 (288/305): perl-overload-1.37-513.el10.noarch.r 18 MB/s | 46 kB 00:00 (289/305): perl-overloading-0.02-513.el10.noarc 5.9 MB/s | 14 kB 00:00 (290/305): rocm-smi-6.4.0-2.el10.x86_64.rpm 311 kB/s | 600 kB 00:01 (291/305): perl-perlfaq-5.20240218-511.el10.noa 62 MB/s | 376 kB 00:00 (292/305): perl-ph-5.40.2-513.el10.x86_64.rpm 20 MB/s | 47 kB 00:00 (293/305): perl-sigtrap-1.10-513.el10.noarch.rp 6.5 MB/s | 17 kB 00:00 (294/305): perl-sort-2.05-513.el10.noarch.rpm 5.9 MB/s | 14 kB 00:00 (295/305): perl-subs-1.04-513.el10.noarch.rpm 6.0 MB/s | 13 kB 00:00 (296/305): perl-threads-2.40-511.el10.x86_64.rp 24 MB/s | 59 kB 00:00 (297/305): perl-threads-shared-1.69-511.el10.x8 14 MB/s | 46 kB 00:00 (298/305): perl-utils-5.40.2-513.el10.noarch.rp 22 MB/s | 53 kB 00:00 (299/305): perl-vars-1.05-513.el10.noarch.rpm 6.0 MB/s | 14 kB 00:00 (300/305): perl-vmsish-1.04-513.el10.noarch.rpm 7.6 MB/s | 15 kB 00:00 (301/305): perl-version-0.99.32-4.el10.x86_64.r 16 MB/s | 64 kB 00:00 (302/305): systemtap-sdt-devel-5.2-2.el10.x86_6 19 MB/s | 76 kB 00:00 (303/305): systemtap-sdt-dtrace-5.2-2.el10.x86_ 17 MB/s | 70 kB 00:00 (304/305): zlib-ng-compat-devel-2.2.3-2.el10.x8 13 MB/s | 39 kB 00:00 (305/305): rocm-comgr-19-6.rocm6.4.0.el10.x86_6 444 kB/s | 31 MB 01:10 -------------------------------------------------------------------------------- Total 4.2 MB/s | 298 MB 01:10 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : cmake-filesystem-3.30.5-3.el10.x86_64 1/305 Installing : libgpg-error-1.50-2.el10.x86_64 2/305 Installing : libmpc-1.3.1-7.el10.x86_64 3/305 Installing : make-1:4.4.1-9.el10.x86_64 4/305 Installing : less-661-3.el10.x86_64 5/305 Installing : expat-2.6.4-1.el10.x86_64 6/305 Installing : kernel-headers-6.12.0-80.el10.x86_64 7/305 Installing : libxcrypt-devel-4.4.36-10.el10.x86_64 8/305 Installing : glibc-devel-2.39-37.el10.x86_64 9/305 Installing : vim-filesystem-2:9.1.083-5.el10.noarch 10/305 Installing : numactl-libs-2.0.19-1.el10.x86_64 11/305 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 12/305 Installing : groff-base-1.23.0-10.el10.x86_64 12/305 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 12/305 Installing : rocm-llvm-filesystem-19-6.rocm6.4.0.el10.x86_64 13/305 Installing : rocm-libc++-19-6.rocm6.4.0.el10.x86_64 14/305 Installing : rocm-llvm-libs-19-6.rocm6.4.0.el10.x86_64 15/305 Installing : rocm-clang-libs-19-6.rocm6.4.0.el10.x86_64 16/305 Installing : rocm-comgr-19-6.rocm6.4.0.el10.x86_64 17/305 Installing : rocm-lld-19-6.rocm6.4.0.el10.x86_64 18/305 Installing : rocm-libc++-devel-19-6.rocm6.4.0.el10.x86_64 19/305 Installing : cpp-14.2.1-7.el10.x86_64 20/305 Installing : gcc-14.2.1-7.el10.x86_64 21/305 Running scriptlet: gcc-14.2.1-7.el10.x86_64 21/305 Installing : libassuan-2.5.6-6.el10.x86_64 22/305 Installing : libgcrypt-1.11.0-6.el10.x86_64 23/305 Installing : libksba-1.6.7-2.el10.x86_64 24/305 Installing : zlib-ng-compat-devel-2.2.3-2.el10.x86_64 25/305 Installing : libuv-1:1.49.2-2.el10.x86_64 26/305 Installing : libstdc++-devel-14.2.1-7.el10.x86_64 27/305 Installing : gcc-c++-14.2.1-7.el10.x86_64 28/305 Installing : emacs-filesystem-1:29.4-10.el10.noarch 29/305 Installing : annobin-docs-12.93-1.el10.noarch 30/305 Installing : tzdata-2025b-1.el10.noarch 31/305 Running scriptlet: tpm2-tss-4.1.3-5.el10.x86_64 32/305 Installing : tpm2-tss-4.1.3-5.el10.x86_64 32/305 Installing : tcl-1:8.6.13-4.el10.x86_64 33/305 Installing : python3-pip-wheel-23.3.2-7.el10.noarch 34/305 Installing : procps-ng-4.0.4-7.el10.x86_64 35/305 Installing : openssh-9.9p1-9.el10.x86_64 36/305 Installing : npth-1.6-21.el10.x86_64 37/305 Installing : ncurses-6.4-14.20240127.el10.x86_64 38/305 Installing : perl-Digest-1.20-511.el10.noarch 39/305 Installing : perl-Digest-MD5-2.59-6.el10.x86_64 40/305 Installing : perl-B-1.89-513.el10.x86_64 41/305 Installing : perl-FileHandle-2.05-513.el10.noarch 42/305 Installing : perl-Data-Dumper-2.189-512.el10.x86_64 43/305 Installing : perl-libnet-3.15-512.el10.noarch 44/305 Installing : perl-AutoLoader-5.74-513.el10.noarch 45/305 Installing : perl-Class-Struct-0.68-513.el10.noarch 46/305 Installing : perl-locale-1.12-513.el10.noarch 47/305 Installing : perl-URI-5.27-3.el10.noarch 48/305 Installing : perl-Socket-4:2.038-511.el10.x86_64 49/305 Installing : perl-Term-ANSIColor-5.01-512.el10.noarch 50/305 Installing : perl-Text-Tabs+Wrap-2024.001-511.el10.noarch 51/305 Installing : perl-Time-Local-2:1.350-511.el10.noarch 52/305 Installing : perl-Mozilla-CA-20231213-5.el10.noarch 53/305 Installing : perl-SelectSaver-1.02-513.el10.noarch 54/305 Installing : perl-base-2.27-513.el10.noarch 55/305 Installing : perl-if-0.61.000-513.el10.noarch 56/305 Installing : perl-POSIX-2.20-513.el10.x86_64 57/305 Installing : perl-mro-1.29-513.el10.x86_64 58/305 Installing : perl-IO-Socket-IP-0.42-512.el10.noarch 59/305 Installing : perl-File-stat-1.14-513.el10.noarch 60/305 Installing : perl-overloading-0.02-513.el10.noarch 61/305 Installing : perl-File-Path-2.18-511.el10.noarch 62/305 Installing : perl-IO-Socket-SSL-2.085-3.el10.noarch 63/305 Installing : perl-Net-SSLeay-1.94-7.el10.x86_64 64/305 Installing : perl-Pod-Escapes-1:1.07-511.el10.noarch 65/305 Installing : perl-Term-Cap-1.18-511.el10.noarch 66/305 Installing : perl-Pod-Usage-4:2.03-511.el10.noarch 67/305 Installing : perl-MIME-Base64-3.16-511.el10.x86_64 68/305 Installing : perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 69/305 Installing : perl-Text-ParseWords-3.31-511.el10.noarch 70/305 Installing : perl-constant-1.33-512.el10.noarch 71/305 Installing : perl-parent-1:0.241-512.el10.noarch 72/305 Installing : perl-Errno-1.38-513.el10.x86_64 73/305 Installing : perl-Fcntl-1.18-513.el10.x86_64 74/305 Installing : perl-File-Basename-2.86-513.el10.noarch 75/305 Installing : perl-Getopt-Std-1.14-513.el10.noarch 76/305 Installing : perl-Symbol-1.09-513.el10.noarch 77/305 Installing : perl-Storable-1:3.32-511.el10.x86_64 78/305 Installing : perl-IPC-Open3-1.22-513.el10.noarch 79/305 Installing : perl-IO-1.55-513.el10.x86_64 80/305 Installing : perl-overload-1.37-513.el10.noarch 81/305 Installing : perl-File-Temp-1:0.231.100-512.el10.noarch 82/305 Installing : perl-HTTP-Tiny-0.088-512.el10.noarch 83/305 Installing : perl-vars-1.05-513.el10.noarch 84/305 Installing : perl-Pod-Simple-1:3.45-511.el10.noarch 85/305 Installing : perl-Getopt-Long-1:2.58-3.el10.noarch 86/305 Installing : perl-podlators-1:5.01-511.el10.noarch 87/305 Installing : perl-Carp-1.54-511.el10.noarch 88/305 Installing : perl-Exporter-5.78-511.el10.noarch 89/305 Installing : perl-PathTools-3.91-512.el10.x86_64 90/305 Installing : perl-GDBM_File-1:1.24-513.el10.x86_64 91/305 Installing : perl-NDBM_File-1.17-513.el10.x86_64 92/305 Installing : perl-ODBM_File-1.18-513.el10.x86_64 93/305 Installing : perl-DynaLoader-1.56-513.el10.x86_64 94/305 Installing : perl-Encode-4:3.21-511.el10.x86_64 95/305 Installing : perl-libs-4:5.40.2-513.el10.x86_64 96/305 Installing : perl-Pod-Perldoc-3.28.01-512.el10.noarch 97/305 Installing : perl-doc-5.40.2-513.el10.noarch 98/305 Installing : perl-interpreter-4:5.40.2-513.el10.x86_64 99/305 Installing : perl-File-Find-1.44-513.el10.noarch 100/305 Installing : perl-version-8:0.99.32-4.el10.x86_64 101/305 Installing : perl-File-Copy-2.41-513.el10.noarch 102/305 Installing : perl-Time-HiRes-4:1.9777-511.el10.x86_64 103/305 Installing : perl-ExtUtils-Manifest-1:1.75-511.el10.noarch 104/305 Installing : perl-lib-0.65-513.el10.x86_64 105/305 Installing : perl-threads-1:2.40-511.el10.x86_64 106/305 Installing : perl-threads-shared-1.69-511.el10.x86_64 107/305 Installing : perl-CPAN-Meta-Requirements-2.143-11.el10.noarch 108/305 Installing : perl-ExtUtils-ParseXS-1:3.51-512.el10.noarch 109/305 Installing : perl-Compress-Raw-Zlib-2.212-512.el10.x86_64 110/305 Installing : perl-File-Compare-1.100.800-513.el10.noarch 111/305 Installing : perl-Module-CoreList-1:5.20250120-1.el10.noarch 112/305 Installing : perl-Module-Metadata-1.000038-512.el10.noarch 113/305 Installing : perl-Digest-SHA-1:6.04-512.el10.x86_64 114/305 Installing : perl-Devel-Peek-1.34-513.el10.x86_64 115/305 Installing : perl-Filter-2:1.64-512.el10.x86_64 116/305 Installing : perl-Module-Load-1:0.36-511.el10.noarch 117/305 Installing : perl-Perl-OSType-1.010-512.el10.noarch 118/305 Installing : perl-Term-ReadLine-1.17-513.el10.noarch 119/305 Installing : perl-TermReadKey-2.38-24.el10.x86_64 120/305 Installing : perl-Term-Table-0.018-513.el10.noarch 121/305 Installing : perl-Tie-4.6-513.el10.noarch 122/305 Installing : perl-Unicode-Normalize-1.32-511.el10.x86_64 123/305 Installing : perl-meta-notation-5.40.2-513.el10.noarch 124/305 Installing : perl-encoding-4:3.00-511.el10.x86_64 125/305 Installing : perl-Dumpvalue-2.27-513.el10.noarch 126/305 Installing : perl-Net-Ping-2.76-511.el10.noarch 127/305 Installing : perl-ExtUtils-Command-2:7.70-513.el10.noarch 128/305 Installing : perl-Pod-Html-1.35-513.el10.noarch 129/305 Installing : perl-AutoSplit-5.74-513.el10.noarch 130/305 Installing : perl-Benchmark-1.25-513.el10.noarch 131/305 Installing : perl-Test-Harness-1:3.48-512.el10.noarch 132/305 Installing : perl-CPAN-Meta-YAML-0.018-512.el10.noarch 133/305 Installing : perl-Compress-Raw-Bzip2-2.212-512.el10.x86_64 134/305 Installing : perl-IO-Compress-2.212-512.el10.noarch 135/305 Installing : perl-IO-Zlib-1:1.15-511.el10.noarch 136/305 Installing : perl-Devel-PPPort-3.72-512.el10.x86_64 137/305 Installing : perl-DirHandle-1.05-513.el10.noarch 138/305 Installing : perl-ExtUtils-Constant-0.25-513.el10.noarch 139/305 Installing : perl-ExtUtils-MM-Utils-2:7.70-513.el10.noarch 140/305 Installing : perl-Hash-Util-FieldHash-1.27-513.el10.x86_64 141/305 Installing : perl-Hash-Util-0.32-513.el10.x86_64 142/305 Installing : perl-I18N-LangTags-0.45-513.el10.noarch 143/305 Installing : perl-Locale-Maketext-1.33-512.el10.noarch 144/305 Installing : perl-Locale-Maketext-Simple-1:0.21-513.el10.noar 145/305 Installing : perl-Params-Check-1:0.38-511.el10.noarch 146/305 Installing : perl-Module-Load-Conditional-0.74-511.el10.noarc 147/305 Installing : perl-IPC-Cmd-2:1.04-512.el10.noarch 148/305 Installing : perl-Math-Complex-1.62-513.el10.noarch 149/305 Installing : perl-Math-BigInt-1:2.0030.03-3.el10.noarch 150/305 Installing : perl-JSON-PP-1:4.16-512.el10.noarch 151/305 Installing : perl-CPAN-Meta-2.150010-511.el10.noarch 152/305 Installing : perl-Test-Simple-3:1.302199-513.el10.noarch 153/305 Installing : perl-Params-Util-1.102-19.el10.x86_64 154/305 Installing : perl-SelfLoader-1.27-513.el10.noarch 155/305 Installing : perl-Sub-Install-0.929-8.el10.noarch 156/305 Installing : perl-Sys-Hostname-1.25-513.el10.x86_64 157/305 Installing : perl-Text-Balanced-2.06-511.el10.noarch 158/305 Installing : perl-Tie-RefHash-1.40-511.el10.noarch 159/305 Installing : perl-User-pwent-1.05-513.el10.noarch 160/305 Installing : perl-autouse-1.11-513.el10.noarch 161/305 Installing : perl-subs-1.04-513.el10.noarch 162/305 Installing : perl-Opcode-1.65-513.el10.x86_64 163/305 Installing : perl-Safe-2.46-513.el10.noarch 164/305 Installing : perl-Filter-Simple-0.96-511.el10.noarch 165/305 Installing : perl-Data-OptList-0.114-7.el10.noarch 166/305 Installing : perl-Devel-SelfStubber-1.06-513.el10.noarch 167/305 Installing : perl-Test2-Suite-0.000163-4.el10.noarch 168/305 Installing : perl-Math-BigInt-FastCalc-0.501.800-511.el10.x86 169/305 Installing : perl-bignum-0.67-512.el10.noarch 170/305 Installing : perl-fields-2.27-513.el10.noarch 171/305 Installing : perl-DBM_Filter-0.06-513.el10.noarch 172/305 Installing : perl-open-1.13-513.el10.noarch 173/305 Installing : perl-debugger-1.60-513.el10.noarch 174/305 Installing : perl-sigtrap-1.10-513.el10.noarch 175/305 Installing : perl-Unicode-Collate-1.31-511.el10.x86_64 176/305 Installing : perl-Unicode-UCD-0.78-513.el10.noarch 177/305 Installing : perl-Env-1.06-511.el10.noarch 178/305 Installing : perl-Module-CoreList-tools-1:5.20250120-1.el10.n 179/305 Installing : perl-Archive-Zip-1.68-17.el10.noarch 180/305 Installing : perl-Thread-3.05-513.el10.noarch 181/305 Installing : perl-Thread-Queue-3.14-511.el10.noarch 182/305 Installing : perl-Thread-Semaphore-2.13-513.el10.noarch 183/305 Installing : perl-experimental-0.032-511.el10.noarch 184/305 Installing : perl-Pod-Checker-4:1.77-511.el10.noarch 185/305 Installing : perl-diagnostics-1.40-513.el10.noarch 186/305 Installing : perl-macros-4:5.40.2-513.el10.noarch 187/305 Installing : perl-utils-5.40.2-513.el10.noarch 188/305 Installing : perl-Algorithm-Diff-1.2010-14.el10.noarch 189/305 Installing : perl-Text-Diff-1.45-24.el10.noarch 190/305 Installing : perl-Attribute-Handlers-1.03-513.el10.noarch 191/305 Installing : perl-Compress-Bzip2-2.28-21.el10.x86_64 192/305 Installing : perl-Compress-Raw-Lzma-2.212-3.el10.x86_64 193/305 Installing : perl-IO-Compress-Lzma-2.206-7.el10.noarch 194/305 Installing : perl-Archive-Tar-3.02-512.el10.noarch 195/305 Installing : perl-Config-Extensions-0.03-513.el10.noarch 196/305 Installing : perl-Config-Perl-V-0.36-512.el10.noarch 197/305 Installing : perl-Devel-Size-0.84-3.el10.x86_64 198/305 Installing : perl-English-1.11-513.el10.noarch 199/305 Installing : perl-Error-1:0.17029-18.el10.noarch 200/305 Installing : perl-File-DosGlob-1.12-513.el10.x86_64 201/305 Installing : perl-File-Which-1.27-14.el10.noarch 202/305 Installing : perl-File-HomeDir-1.006-15.el10.noarch 203/305 Installing : perl-FileCache-1.10-513.el10.noarch 204/305 Installing : perl-FindBin-1.54-513.el10.noarch 205/305 Installing : perl-I18N-Collate-1.02-513.el10.noarch 206/305 Installing : perl-I18N-Langinfo-0.24-513.el10.x86_64 207/305 Installing : perl-IPC-SysV-2.09-512.el10.x86_64 208/305 Installing : perl-IPC-System-Simple-1.30-16.el10.noarch 209/305 Installing : perl-autodie-2.37-512.el10.noarch 210/305 Installing : perl-MRO-Compat-0.15-12.el10.noarch 211/305 Installing : perl-Memoize-1.16-513.el10.noarch 212/305 Installing : perl-Module-Loaded-1:0.08-513.el10.noarch 213/305 Installing : perl-NEXT-0.69-513.el10.noarch 214/305 Installing : perl-Net-1.04-513.el10.noarch 215/305 Installing : perl-Package-Generator-1.106-34.el10.noarch 216/305 Installing : perl-Sub-Exporter-0.991-7.el10.noarch 217/305 Installing : perl-Data-Section-0.200008-8.el10.noarch 218/305 Installing : perl-PerlIO-via-QuotedPrint-0.10-511.el10.noarch 219/305 Installing : perl-Pod-Functions-1.14-513.el10.noarch 220/305 Installing : perl-Search-Dict-1.07-513.el10.noarch 221/305 Installing : perl-Sys-Syslog-0.36-512.el10.x86_64 222/305 Installing : perl-Term-Complete-1.403-513.el10.noarch 223/305 Installing : perl-Test-1.31-513.el10.noarch 224/305 Installing : perl-Text-Abbrev-1.02-513.el10.noarch 225/305 Installing : perl-Text-Glob-0.11-26.el10.noarch 226/305 Installing : perl-Text-Template-1.61-8.el10.noarch 227/305 Installing : perl-Software-License-0.104006-4.el10.noarch 228/305 Installing : perl-Tie-File-1.09-513.el10.noarch 229/305 Installing : perl-Tie-Memoize-1.1-513.el10.noarch 230/305 Installing : perl-Time-1.04-513.el10.noarch 231/305 Installing : perl-Time-Piece-1.3401-513.el10.x86_64 232/305 Installing : perl-blib-1.07-513.el10.noarch 233/305 Installing : perl-deprecate-0.04-513.el10.noarch 234/305 Installing : perl-encoding-warnings-0.14-513.el10.noarch 235/305 Installing : perl-filetest-1.03-513.el10.noarch 236/305 Installing : perl-less-0.03-513.el10.noarch 237/305 Installing : perl-local-lib-2.000029-9.el10.noarch 238/305 Installing : perl-perlfaq-5.20240218-511.el10.noarch 239/305 Installing : perl-ph-5.40.2-513.el10.x86_64 240/305 Installing : perl-sort-2.05-513.el10.noarch 241/305 Installing : perl-vmsish-1.04-513.el10.noarch 242/305 Installing : mpdecimal-2.5.1-12.el10.x86_64 243/305 Installing : python3-3.12.10-2.el10.x86_64 244/305 Installing : python3-libs-3.12.10-2.el10.x86_64 245/305 Installing : cmake-rpm-macros-3.30.5-3.el10.noarch 246/305 Installing : cmake-data-3.30.5-3.el10.noarch 247/305 Installing : cmake-3.30.5-3.el10.x86_64 248/305 Installing : python3-pyparsing-3.1.1-7.el10.noarch 249/305 Installing : systemtap-sdt-dtrace-5.2-2.el10.x86_64 250/305 Installing : systemtap-sdt-devel-5.2-2.el10.x86_64 251/305 Installing : perl-ExtUtils-Install-2.22-511.el10.noarch 252/305 Installing : perl-devel-4:5.40.2-513.el10.x86_64 253/305 Installing : perl-ExtUtils-MakeMaker-2:7.70-513.el10.noarch 254/305 Installing : perl-ExtUtils-CBuilder-1:0.280240-511.el10.noarc 255/305 Installing : perl-ExtUtils-Embed-1.35-513.el10.noarch 256/305 Installing : perl-ExtUtils-Miniperl-1.14-513.el10.noarch 257/305 Installing : perl-libnetcfg-4:5.40.2-513.el10.noarch 258/305 Installing : perl-Encode-devel-4:3.21-511.el10.x86_64 259/305 Installing : perl-inc-latest-2:0.500-30.el10.noarch 260/305 Installing : perl-Module-Build-2:0.42.34-7.el10.noarch 261/305 Installing : libpipeline-1.5.7-7.el10.x86_64 262/305 Running scriptlet: man-db-2.12.0-8.el10.x86_64 263/305 Installing : man-db-2.12.0-8.el10.x86_64 263/305 Running scriptlet: man-db-2.12.0-8.el10.x86_64 263/305 Installing : environment-modules-5.3.1-8.el10.x86_64 264/305 Running scriptlet: environment-modules-5.3.1-8.el10.x86_64 264/305 Installing : libedit-3.1-52.20230828cvs.el10.x86_64 265/305 Installing : libcbor-0.11.0-3.el10.x86_64 266/305 Installing : libfido2-1.14.0-7.el10.x86_64 267/305 Installing : openssh-clients-9.9p1-9.el10.x86_64 268/305 Running scriptlet: openssh-clients-9.9p1-9.el10.x86_64 268/305 Installing : git-core-2.47.1-1.el10.x86_64 269/305 Installing : git-core-doc-2.47.1-1.el10.noarch 270/305 Installing : perl-Git-2.47.1-1.el10.noarch 271/305 Installing : git-2.47.1-1.el10.x86_64 272/305 Installing : perl-File-Fetch-1.04-511.el10.noarch 273/305 Installing : hwdata-0.379-10.4.el10.noarch 274/305 Installing : libpciaccess-0.16-16.el10.x86_64 275/305 Installing : libdrm-2.4.123-1.el10.x86_64 276/305 Installing : rocm-runtime-6.4.0-1.el10.x86_64 277/305 Installing : rocm-runtime-devel-6.4.0-1.el10.x86_64 278/305 Installing : rocm-llvm-19-6.rocm6.4.0.el10.x86_64 279/305 Installing : rocm-llvm-devel-19-6.rocm6.4.0.el10.x86_64 280/305 Installing : rocm-llvm-static-19-6.rocm6.4.0.el10.x86_64 281/305 Installing : rocm-smi-6.4.0-2.el10.x86_64 282/305 Installing : libpciaccess-devel-0.16-16.el10.x86_64 283/305 Installing : libdrm-devel-2.4.123-1.el10.x86_64 284/305 Installing : gnutls-3.8.9-14.el10.x86_64 285/305 Installing : gnupg2-2.4.5-2.el10.x86_64 286/305 Installing : perl-Module-Signature-0.88-12.el10.noarch 287/305 Installing : perl-CPAN-2.36-512.el10.noarch 288/305 Installing : perl-4:5.40.2-513.el10.x86_64 289/305 Installing : rocm-clang-runtime-devel-19-6.rocm6.4.0.el10.x86 290/305 Installing : rocm-clang-19-6.rocm6.4.0.el10.x86_64 291/305 Installing : rocm-clang-devel-19-6.rocm6.4.0.el10.x86_64 292/305 Installing : rocm-device-libs-19-6.rocm6.4.0.el10.x86_64 293/305 Installing : hipcc-19-6.rocm6.4.0.el10.x86_64 294/305 Installing : rocm-hip-6.4.0-3.el10.x86_64 295/305 Installing : rocm-comgr-devel-19-6.rocm6.4.0.el10.x86_64 296/305 Installing : rocm-core-6.4.0-1.el10.x86_64 297/305 Installing : rocm-core-devel-6.4.0-1.el10.x86_64 298/305 Installing : rocm-hip-devel-6.4.0-3.el10.x86_64 299/305 Installing : hipify-6.4.0-1.el10.x86_64 300/305 Installing : rocm-smi-devel-6.4.0-2.el10.x86_64 301/305 Installing : rocm-rpm-macros-6.4.0-4.limitedisa.el10.noarch 302/305 Installing : rocm-cmake-6.4.0-1.el10.noarch 303/305 Installing : annobin-plugin-gcc-12.93-1.el10.x86_64 304/305 Running scriptlet: annobin-plugin-gcc-12.93-1.el10.x86_64 304/305 Installing : gcc-plugin-annobin-14.2.1-7.el10.x86_64 305/305 Running scriptlet: gcc-plugin-annobin-14.2.1-7.el10.x86_64 305/305 Installed: annobin-docs-12.93-1.el10.noarch annobin-plugin-gcc-12.93-1.el10.x86_64 cmake-3.30.5-3.el10.x86_64 cmake-data-3.30.5-3.el10.noarch cmake-filesystem-3.30.5-3.el10.x86_64 cmake-rpm-macros-3.30.5-3.el10.noarch cpp-14.2.1-7.el10.x86_64 emacs-filesystem-1:29.4-10.el10.noarch environment-modules-5.3.1-8.el10.x86_64 expat-2.6.4-1.el10.x86_64 gcc-14.2.1-7.el10.x86_64 gcc-c++-14.2.1-7.el10.x86_64 gcc-plugin-annobin-14.2.1-7.el10.x86_64 git-2.47.1-1.el10.x86_64 git-core-2.47.1-1.el10.x86_64 git-core-doc-2.47.1-1.el10.noarch glibc-devel-2.39-37.el10.x86_64 gnupg2-2.4.5-2.el10.x86_64 gnutls-3.8.9-14.el10.x86_64 groff-base-1.23.0-10.el10.x86_64 hipcc-19-6.rocm6.4.0.el10.x86_64 hipify-6.4.0-1.el10.x86_64 hwdata-0.379-10.4.el10.noarch kernel-headers-6.12.0-80.el10.x86_64 less-661-3.el10.x86_64 libassuan-2.5.6-6.el10.x86_64 libcbor-0.11.0-3.el10.x86_64 libdrm-2.4.123-1.el10.x86_64 libdrm-devel-2.4.123-1.el10.x86_64 libedit-3.1-52.20230828cvs.el10.x86_64 libfido2-1.14.0-7.el10.x86_64 libgcrypt-1.11.0-6.el10.x86_64 libgpg-error-1.50-2.el10.x86_64 libksba-1.6.7-2.el10.x86_64 libmpc-1.3.1-7.el10.x86_64 libpciaccess-0.16-16.el10.x86_64 libpciaccess-devel-0.16-16.el10.x86_64 libpipeline-1.5.7-7.el10.x86_64 libstdc++-devel-14.2.1-7.el10.x86_64 libuv-1:1.49.2-2.el10.x86_64 libxcrypt-devel-4.4.36-10.el10.x86_64 make-1:4.4.1-9.el10.x86_64 man-db-2.12.0-8.el10.x86_64 mpdecimal-2.5.1-12.el10.x86_64 ncurses-6.4-14.20240127.el10.x86_64 npth-1.6-21.el10.x86_64 numactl-libs-2.0.19-1.el10.x86_64 openssh-9.9p1-9.el10.x86_64 openssh-clients-9.9p1-9.el10.x86_64 perl-4:5.40.2-513.el10.x86_64 perl-Algorithm-Diff-1.2010-14.el10.noarch perl-Archive-Tar-3.02-512.el10.noarch perl-Archive-Zip-1.68-17.el10.noarch perl-Attribute-Handlers-1.03-513.el10.noarch perl-AutoLoader-5.74-513.el10.noarch perl-AutoSplit-5.74-513.el10.noarch perl-B-1.89-513.el10.x86_64 perl-Benchmark-1.25-513.el10.noarch perl-CPAN-2.36-512.el10.noarch perl-CPAN-Meta-2.150010-511.el10.noarch perl-CPAN-Meta-Requirements-2.143-11.el10.noarch perl-CPAN-Meta-YAML-0.018-512.el10.noarch perl-Carp-1.54-511.el10.noarch perl-Class-Struct-0.68-513.el10.noarch perl-Compress-Bzip2-2.28-21.el10.x86_64 perl-Compress-Raw-Bzip2-2.212-512.el10.x86_64 perl-Compress-Raw-Lzma-2.212-3.el10.x86_64 perl-Compress-Raw-Zlib-2.212-512.el10.x86_64 perl-Config-Extensions-0.03-513.el10.noarch perl-Config-Perl-V-0.36-512.el10.noarch perl-DBM_Filter-0.06-513.el10.noarch perl-Data-Dumper-2.189-512.el10.x86_64 perl-Data-OptList-0.114-7.el10.noarch perl-Data-Section-0.200008-8.el10.noarch perl-Devel-PPPort-3.72-512.el10.x86_64 perl-Devel-Peek-1.34-513.el10.x86_64 perl-Devel-SelfStubber-1.06-513.el10.noarch perl-Devel-Size-0.84-3.el10.x86_64 perl-Digest-1.20-511.el10.noarch perl-Digest-MD5-2.59-6.el10.x86_64 perl-Digest-SHA-1:6.04-512.el10.x86_64 perl-DirHandle-1.05-513.el10.noarch perl-Dumpvalue-2.27-513.el10.noarch perl-DynaLoader-1.56-513.el10.x86_64 perl-Encode-4:3.21-511.el10.x86_64 perl-Encode-devel-4:3.21-511.el10.x86_64 perl-English-1.11-513.el10.noarch perl-Env-1.06-511.el10.noarch perl-Errno-1.38-513.el10.x86_64 perl-Error-1:0.17029-18.el10.noarch perl-Exporter-5.78-511.el10.noarch perl-ExtUtils-CBuilder-1:0.280240-511.el10.noarch perl-ExtUtils-Command-2:7.70-513.el10.noarch perl-ExtUtils-Constant-0.25-513.el10.noarch perl-ExtUtils-Embed-1.35-513.el10.noarch perl-ExtUtils-Install-2.22-511.el10.noarch perl-ExtUtils-MM-Utils-2:7.70-513.el10.noarch perl-ExtUtils-MakeMaker-2:7.70-513.el10.noarch perl-ExtUtils-Manifest-1:1.75-511.el10.noarch perl-ExtUtils-Miniperl-1.14-513.el10.noarch perl-ExtUtils-ParseXS-1:3.51-512.el10.noarch perl-Fcntl-1.18-513.el10.x86_64 perl-File-Basename-2.86-513.el10.noarch perl-File-Compare-1.100.800-513.el10.noarch perl-File-Copy-2.41-513.el10.noarch perl-File-DosGlob-1.12-513.el10.x86_64 perl-File-Fetch-1.04-511.el10.noarch perl-File-Find-1.44-513.el10.noarch perl-File-HomeDir-1.006-15.el10.noarch perl-File-Path-2.18-511.el10.noarch perl-File-Temp-1:0.231.100-512.el10.noarch perl-File-Which-1.27-14.el10.noarch perl-File-stat-1.14-513.el10.noarch perl-FileCache-1.10-513.el10.noarch perl-FileHandle-2.05-513.el10.noarch perl-Filter-2:1.64-512.el10.x86_64 perl-Filter-Simple-0.96-511.el10.noarch perl-FindBin-1.54-513.el10.noarch perl-GDBM_File-1:1.24-513.el10.x86_64 perl-Getopt-Long-1:2.58-3.el10.noarch perl-Getopt-Std-1.14-513.el10.noarch perl-Git-2.47.1-1.el10.noarch perl-HTTP-Tiny-0.088-512.el10.noarch perl-Hash-Util-0.32-513.el10.x86_64 perl-Hash-Util-FieldHash-1.27-513.el10.x86_64 perl-I18N-Collate-1.02-513.el10.noarch perl-I18N-LangTags-0.45-513.el10.noarch perl-I18N-Langinfo-0.24-513.el10.x86_64 perl-IO-1.55-513.el10.x86_64 perl-IO-Compress-2.212-512.el10.noarch perl-IO-Compress-Lzma-2.206-7.el10.noarch perl-IO-Socket-IP-0.42-512.el10.noarch perl-IO-Socket-SSL-2.085-3.el10.noarch perl-IO-Zlib-1:1.15-511.el10.noarch perl-IPC-Cmd-2:1.04-512.el10.noarch perl-IPC-Open3-1.22-513.el10.noarch perl-IPC-SysV-2.09-512.el10.x86_64 perl-IPC-System-Simple-1.30-16.el10.noarch perl-JSON-PP-1:4.16-512.el10.noarch perl-Locale-Maketext-1.33-512.el10.noarch perl-Locale-Maketext-Simple-1:0.21-513.el10.noarch perl-MIME-Base64-3.16-511.el10.x86_64 perl-MRO-Compat-0.15-12.el10.noarch perl-Math-BigInt-1:2.0030.03-3.el10.noarch perl-Math-BigInt-FastCalc-0.501.800-511.el10.x86_64 perl-Math-Complex-1.62-513.el10.noarch perl-Memoize-1.16-513.el10.noarch perl-Module-Build-2:0.42.34-7.el10.noarch perl-Module-CoreList-1:5.20250120-1.el10.noarch perl-Module-CoreList-tools-1:5.20250120-1.el10.noarch perl-Module-Load-1:0.36-511.el10.noarch perl-Module-Load-Conditional-0.74-511.el10.noarch perl-Module-Loaded-1:0.08-513.el10.noarch perl-Module-Metadata-1.000038-512.el10.noarch perl-Module-Signature-0.88-12.el10.noarch perl-Mozilla-CA-20231213-5.el10.noarch perl-NDBM_File-1.17-513.el10.x86_64 perl-NEXT-0.69-513.el10.noarch perl-Net-1.04-513.el10.noarch perl-Net-Ping-2.76-511.el10.noarch perl-Net-SSLeay-1.94-7.el10.x86_64 perl-ODBM_File-1.18-513.el10.x86_64 perl-Opcode-1.65-513.el10.x86_64 perl-POSIX-2.20-513.el10.x86_64 perl-Package-Generator-1.106-34.el10.noarch perl-Params-Check-1:0.38-511.el10.noarch perl-Params-Util-1.102-19.el10.x86_64 perl-PathTools-3.91-512.el10.x86_64 perl-Perl-OSType-1.010-512.el10.noarch perl-PerlIO-via-QuotedPrint-0.10-511.el10.noarch perl-Pod-Checker-4:1.77-511.el10.noarch perl-Pod-Escapes-1:1.07-511.el10.noarch perl-Pod-Functions-1.14-513.el10.noarch perl-Pod-Html-1.35-513.el10.noarch perl-Pod-Perldoc-3.28.01-512.el10.noarch perl-Pod-Simple-1:3.45-511.el10.noarch perl-Pod-Usage-4:2.03-511.el10.noarch perl-Safe-2.46-513.el10.noarch perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 perl-Search-Dict-1.07-513.el10.noarch perl-SelectSaver-1.02-513.el10.noarch perl-SelfLoader-1.27-513.el10.noarch perl-Socket-4:2.038-511.el10.x86_64 perl-Software-License-0.104006-4.el10.noarch perl-Storable-1:3.32-511.el10.x86_64 perl-Sub-Exporter-0.991-7.el10.noarch perl-Sub-Install-0.929-8.el10.noarch perl-Symbol-1.09-513.el10.noarch perl-Sys-Hostname-1.25-513.el10.x86_64 perl-Sys-Syslog-0.36-512.el10.x86_64 perl-Term-ANSIColor-5.01-512.el10.noarch perl-Term-Cap-1.18-511.el10.noarch perl-Term-Complete-1.403-513.el10.noarch perl-Term-ReadLine-1.17-513.el10.noarch perl-Term-Table-0.018-513.el10.noarch perl-TermReadKey-2.38-24.el10.x86_64 perl-Test-1.31-513.el10.noarch perl-Test-Harness-1:3.48-512.el10.noarch perl-Test-Simple-3:1.302199-513.el10.noarch perl-Test2-Suite-0.000163-4.el10.noarch perl-Text-Abbrev-1.02-513.el10.noarch perl-Text-Balanced-2.06-511.el10.noarch perl-Text-Diff-1.45-24.el10.noarch perl-Text-Glob-0.11-26.el10.noarch perl-Text-ParseWords-3.31-511.el10.noarch perl-Text-Tabs+Wrap-2024.001-511.el10.noarch perl-Text-Template-1.61-8.el10.noarch perl-Thread-3.05-513.el10.noarch perl-Thread-Queue-3.14-511.el10.noarch perl-Thread-Semaphore-2.13-513.el10.noarch perl-Tie-4.6-513.el10.noarch perl-Tie-File-1.09-513.el10.noarch perl-Tie-Memoize-1.1-513.el10.noarch perl-Tie-RefHash-1.40-511.el10.noarch perl-Time-1.04-513.el10.noarch perl-Time-HiRes-4:1.9777-511.el10.x86_64 perl-Time-Local-2:1.350-511.el10.noarch perl-Time-Piece-1.3401-513.el10.x86_64 perl-URI-5.27-3.el10.noarch perl-Unicode-Collate-1.31-511.el10.x86_64 perl-Unicode-Normalize-1.32-511.el10.x86_64 perl-Unicode-UCD-0.78-513.el10.noarch perl-User-pwent-1.05-513.el10.noarch perl-autodie-2.37-512.el10.noarch perl-autouse-1.11-513.el10.noarch perl-base-2.27-513.el10.noarch perl-bignum-0.67-512.el10.noarch perl-blib-1.07-513.el10.noarch perl-constant-1.33-512.el10.noarch perl-debugger-1.60-513.el10.noarch perl-deprecate-0.04-513.el10.noarch perl-devel-4:5.40.2-513.el10.x86_64 perl-diagnostics-1.40-513.el10.noarch perl-doc-5.40.2-513.el10.noarch perl-encoding-4:3.00-511.el10.x86_64 perl-encoding-warnings-0.14-513.el10.noarch perl-experimental-0.032-511.el10.noarch perl-fields-2.27-513.el10.noarch perl-filetest-1.03-513.el10.noarch perl-if-0.61.000-513.el10.noarch perl-inc-latest-2:0.500-30.el10.noarch perl-interpreter-4:5.40.2-513.el10.x86_64 perl-less-0.03-513.el10.noarch perl-lib-0.65-513.el10.x86_64 perl-libnet-3.15-512.el10.noarch perl-libnetcfg-4:5.40.2-513.el10.noarch perl-libs-4:5.40.2-513.el10.x86_64 perl-local-lib-2.000029-9.el10.noarch perl-locale-1.12-513.el10.noarch perl-macros-4:5.40.2-513.el10.noarch perl-meta-notation-5.40.2-513.el10.noarch perl-mro-1.29-513.el10.x86_64 perl-open-1.13-513.el10.noarch perl-overload-1.37-513.el10.noarch perl-overloading-0.02-513.el10.noarch perl-parent-1:0.241-512.el10.noarch perl-perlfaq-5.20240218-511.el10.noarch perl-ph-5.40.2-513.el10.x86_64 perl-podlators-1:5.01-511.el10.noarch perl-sigtrap-1.10-513.el10.noarch perl-sort-2.05-513.el10.noarch perl-subs-1.04-513.el10.noarch perl-threads-1:2.40-511.el10.x86_64 perl-threads-shared-1.69-511.el10.x86_64 perl-utils-5.40.2-513.el10.noarch perl-vars-1.05-513.el10.noarch perl-version-8:0.99.32-4.el10.x86_64 perl-vmsish-1.04-513.el10.noarch procps-ng-4.0.4-7.el10.x86_64 python3-3.12.10-2.el10.x86_64 python3-libs-3.12.10-2.el10.x86_64 python3-pip-wheel-23.3.2-7.el10.noarch python3-pyparsing-3.1.1-7.el10.noarch rocm-clang-19-6.rocm6.4.0.el10.x86_64 rocm-clang-devel-19-6.rocm6.4.0.el10.x86_64 rocm-clang-libs-19-6.rocm6.4.0.el10.x86_64 rocm-clang-runtime-devel-19-6.rocm6.4.0.el10.x86_64 rocm-cmake-6.4.0-1.el10.noarch rocm-comgr-19-6.rocm6.4.0.el10.x86_64 rocm-comgr-devel-19-6.rocm6.4.0.el10.x86_64 rocm-core-6.4.0-1.el10.x86_64 rocm-core-devel-6.4.0-1.el10.x86_64 rocm-device-libs-19-6.rocm6.4.0.el10.x86_64 rocm-hip-6.4.0-3.el10.x86_64 rocm-hip-devel-6.4.0-3.el10.x86_64 rocm-libc++-19-6.rocm6.4.0.el10.x86_64 rocm-libc++-devel-19-6.rocm6.4.0.el10.x86_64 rocm-lld-19-6.rocm6.4.0.el10.x86_64 rocm-llvm-19-6.rocm6.4.0.el10.x86_64 rocm-llvm-devel-19-6.rocm6.4.0.el10.x86_64 rocm-llvm-filesystem-19-6.rocm6.4.0.el10.x86_64 rocm-llvm-libs-19-6.rocm6.4.0.el10.x86_64 rocm-llvm-static-19-6.rocm6.4.0.el10.x86_64 rocm-rpm-macros-6.4.0-4.limitedisa.el10.noarch rocm-runtime-6.4.0-1.el10.x86_64 rocm-runtime-devel-6.4.0-1.el10.x86_64 rocm-smi-6.4.0-2.el10.x86_64 rocm-smi-devel-6.4.0-2.el10.x86_64 systemtap-sdt-devel-5.2-2.el10.x86_64 systemtap-sdt-dtrace-5.2-2.el10.x86_64 tcl-1:8.6.13-4.el10.x86_64 tpm2-tss-4.1.3-5.el10.x86_64 tzdata-2025b-1.el10.noarch vim-filesystem-2:9.1.083-5.el10.noarch zlib-ng-compat-devel-2.2.3-2.el10.x86_64 Complete! Finish: build setup for rccl-6.4.0-2.el10.src.rpm Start: rpmbuild rccl-6.4.0-2.el10.src.rpm warning: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1745366400 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.m0LY5F + umask 022 + cd /builddir/build/BUILD + cd /builddir/build/BUILD + rm -rf rccl-rocm-6.4.0 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/RCCL-6.4.0.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd rccl-rocm-6.4.0 + rm -rf /builddir/build/BUILD/rccl-rocm-6.4.0-SPECPARTS + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0-SPECPARTS + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e '/AMD GPU targets to compile for/d' CMakeLists.txt + sed -i -e 's@cat ${ROCM_PATH}/.info/version@echo 6.4.0@' CMakeLists.txt + sed -i -e s@rocm-core/rocm_version.h@rocm_version.h@ src/include/hip_rocm_version_info.h + sed -i -e 's@if (ENABLE_MSCCLPP AND NOT(${HOST_OS_ID} STREQUAL "ubuntu" OR ${HOST_OS_ID} STREQUAL "centos"))@if (ENABLE_MSCCLPP)@' CMakeLists.txt + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.8ZltTi + umask 022 + cd /builddir/build/BUILD + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.0 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON '-DAMDGPU_TARGETS=gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201' -DBUILD_FILE_REORG_BACKWARD_COMPATIBILITY=OFF -DBUILD_TESTS=OFF -DCMAKE_BUILD_TYPE=RelWithDebInfo -DCMAKE_C_COMPILER=/usr/bin/hipcc -DCMAKE_CXX_COMPILER=/usr/bin/hipcc -DCMAKE_EXPORT_COMPILE_COMMANDS=OFF -DCMAKE_INSTALL_LIBDIR=/usr/lib64 -DCMAKE_SKIP_RPATH=ON -DENABLE_MSCCLPP=OFF -DHIP_PLATFORM=amd -DRCCL_ROCPROFILER_REGISTER=OFF -DROCM_PATH=/usr -DROCM_SYMLINK_LIBS=OFF -- CMAKE_TOOLCHAIN_FILE: /builddir/build/BUILD/rccl-rocm-6.4.0/toolchain-linux.cmake -- The CXX compiler identification is Clang 19.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:55 (include) -- Checking for ROCm support for GPU targets: gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 - Success -- Compiling for gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:102 (include) -- ROCM_PATH found: /usr -- Compiling with hipcc -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- hipcc version: 6.4.43482 -- hipconfig executable: /usr/bin/hipconfig -- hipcc HIP version: 6.4.43482 -- ROCm version: 6.4.0 -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - found -- Looking for hipDeviceMallocContiguous -- Looking for hipDeviceMallocContiguous - found -- RCCL LL128 protocol enabled -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- RSMI_INIT_FLAG_THRAD_ONLY_MUTEX supported -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Performing Test HAVE_PARALLEL_JOBS -- Performing Test HAVE_PARALLEL_JOBS - Success -- Parallel jobs enabled CMake Warning at CMakeLists.txt:331 (message): ROCTX library not found. Skipping ROCTX linking. -- Found Python3: /usr/bin/python3.12 (found version "3.12.10") found components: Interpreter -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.h -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp -- Generating /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp -- HIP_CONTIGUOUS_MEMORY enabled -- HIP_UNCACHED_MEMORY enabled -- Use 1 jobs for linking -- Building shared RCCL library -- rocm-cmake: Set license file to /builddir/build/BUILD/rccl-rocm-6.4.0/LICENSE.txt. -- Configuring done (25.0s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: AMDGPU_TARGETS CMAKE_CXX_FLAGS_RELEASE CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j4 --verbose Change Dir: '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j4 /usr/bin/cmake -S/builddir/build/BUILD/rccl-rocm-6.4.0 -B/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/CMakeFiles /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' cd /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-rocm-6.4.0 /builddir/build/BUILD/rccl-rocm-6.4.0 /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' [ 0%] Built target git_version_check /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' [ 0%] Hipifying src/channel.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc [ 0%] Hipifying src/transport/shm.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc [ 0%] Hipifying src/collectives.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc [ 0%] Hipifying src/bootstrap.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/shm.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/channel.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/bootstrap.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/collectives.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc [ 1%] Hipifying src/debug.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/debug.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc [ 1%] Hipifying src/device/all_gather.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/all_gather.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h [ 1%] Hipifying src/device/all_reduce.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/all_reduce.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h [ 2%] Hipifying src/device/alltoall_pivot.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/alltoall_pivot.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 2%] Hipifying src/device/broadcast.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/broadcast.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h [ 2%] Hipifying src/device/common.cu -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/common.cu -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 2%] Hipifying src/device/common.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/common.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h [ 2%] Hipifying src/device/common_kernel.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common_kernel.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/common_kernel.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h [ 2%] Hipifying src/device/msccl_kernel_impl.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/msccl_kernel_impl.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h [ 3%] Hipifying src/device/network/unpack/unpack.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/network/unpack/unpack.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h [ 3%] Hipifying src/device/network/unpack/unpack_defs.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/network/unpack/unpack_defs.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common_kernel.h [ 3%] Hipifying src/device/onerank.cu -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/onerank.cu -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h [ 4%] Hipifying src/device/op128.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/op128.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/op128.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/op128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/op128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h [ 4%] Hipifying src/device/primitives.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/primitives.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 4%] Hipifying src/device/prims_ll.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/prims_ll.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h [ 4%] Hipifying src/device/prims_ll128.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/prims_ll128.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/op128.h [ 5%] Hipifying src/device/prims_simple.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/prims_simple.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h [ 5%] Hipifying src/device/reduce.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/reduce.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h [ 5%] Hipifying src/device/reduce_kernel.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_kernel.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/reduce_kernel.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h [ 5%] Hipifying src/device/reduce_scatter.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/reduce_scatter.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h [ 6%] Hipifying src/device/sendrecv.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/device/sendrecv.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h [ 6%] Hipifying src/enqueue.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/enqueue.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h [ 6%] Hipifying src/graph/connect.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/connect.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_kernel.h [ 6%] Hipifying src/graph/paths.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/paths.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h [ 6%] Hipifying src/graph/rings.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/rings.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc [ 6%] Hipifying src/graph/rome_models.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc [ 7%] Hipifying src/graph/rings.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/rome_models.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/rings.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.h [ 7%] Hipifying src/graph/rome_models.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/rome_models.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.h [ 7%] Hipifying src/graph/search.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/search.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc [ 8%] Hipifying src/graph/topo.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/topo.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc [ 8%] Hipifying src/graph/topo.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/topo.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h [ 8%] Hipifying src/graph/trees.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/trees.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/trees.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/trees.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/trees.cc [ 8%] Hipifying src/graph/tuning.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/tuning.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc [ 9%] Hipifying src/graph/xml.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/xml.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc [ 9%] Hipifying src/graph/xml.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h [ 9%] Hipifying src/group.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/graph/xml.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/group.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc [ 9%] Hipifying src/include/BfdBacktrace.hpp -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/BfdBacktrace.hpp -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp [ 9%] Hipifying src/include/alloc.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/alloc.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h [ 9%] Hipifying src/include/alt_rsmi.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alt_rsmi.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/alt_rsmi.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alt_rsmi.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alt_rsmi.h [ 9%] Hipifying src/include/api_trace.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/api_trace.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/api_trace.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/api_trace.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/api_trace.h [ 10%] Hipifying src/include/archinfo.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/archinfo.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/archinfo.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/archinfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/archinfo.h [ 10%] Hipifying src/include/argcheck.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/argcheck.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h [ 11%] Hipifying src/include/bitops.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bitops.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/bitops.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bitops.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bitops.h [ 11%] Hipifying src/include/bootstrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/bootstrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h [ 11%] Hipifying src/include/channel.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/channel.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h [ 11%] Hipifying src/include/checks.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/checks.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/checks.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/checks.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/checks.h [ 11%] Hipifying src/include/coll_net.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/coll_net.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h [ 12%] Hipifying src/include/collectives.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/collectives.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h [ 12%] Hipifying src/include/comm.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/comm.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h [ 12%] Hipifying src/include/core.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/core.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h [ 13%] Hipifying src/include/cpuset.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/cpuset.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/cpuset.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/cpuset.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/cpuset.h [ 13%] Hipifying src/include/debug.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/debug.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/debug.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/debug.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/debug.h [ 13%] Hipifying src/include/device.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/device.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h [ 13%] Hipifying src/include/enqueue.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/enqueue.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h [ 14%] Hipifying src/include/gdrwrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/gdrwrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h [ 14%] Hipifying src/include/git_version.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/git_version.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/git_version.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/git_version.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/git_version.h [ 14%] Hipifying src/include/graph.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/graph.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h [ 14%] Hipifying src/include/group.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/group.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h [ 15%] Hipifying src/include/hip_rocm_version_info.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/hip_rocm_version_info.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h [ 15%] Hipifying src/include/ibvcore.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvcore.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/ibvcore.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvcore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvcore.h [ 15%] Hipifying src/include/ibvsymbols.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvsymbols.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/ibvsymbols.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvsymbols.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvsymbols.h [ 15%] Hipifying src/include/ibvwrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/ibvwrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h [ 16%] Hipifying src/include/info.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/info.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h [ 16%] Hipifying src/include/ipcsocket.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ipcsocket.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/ipcsocket.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ipcsocket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ipcsocket.h [ 17%] Hipifying src/include/msccl/msccl_kernel.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_kernel.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h [ 18%] Hipifying src/include/msccl/msccl_lifecycle.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_lifecycle.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h [ 18%] Hipifying src/include/msccl/msccl_parser.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_parser.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h [ 18%] Hipifying src/include/msccl/msccl_scheduler.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_scheduler.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h [ 18%] Hipifying src/include/msccl/msccl_setup.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_setup.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h [ 19%] Hipifying src/include/msccl/msccl_status.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_status.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h [ 19%] Hipifying src/include/msccl/msccl_struct.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/msccl/msccl_struct.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h [ 19%] Hipifying src/include/nccl_common.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_common.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nccl_common.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_common.h [ 19%] Hipifying src/include/nccl_net.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_net.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nccl_net.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_net.h [ 20%] Hipifying src/include/nccl_tuner.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_tuner.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nccl_tuner.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nccl_tuner.h [ 20%] Hipifying src/include/net.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/net.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h [ 20%] Hipifying src/include/net_device.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net_device.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/net_device.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net_device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net_device.h [ 20%] Hipifying src/include/npkit/npkit.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/npkit/npkit.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h [ 20%] Hipifying src/include/npkit/npkit_event.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/npkit/npkit_event.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/npkit/npkit_struct.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h [ 21%] Hipifying src/include/nvmlwrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvmlwrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvmlwrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvmlwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvmlwrap.h [ 22%] Hipifying src/include/nvtx.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExt.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCounters.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtCounters.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtCuda.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtCudaRt.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtMem.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtMem.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtMemCudaRt.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtMemCudaRt.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtOpenCL.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtPayload.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtPayloadHelper.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtPayloadHelper.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSemanticsCounters.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtSemanticsCounters.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSemanticsScope.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtSemanticsScope.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 25%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvToolsExtSync.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtx3.hpp -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtInit.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtInit.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h [ 30%] Hipifying src/include/nvtx_stub.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx_stub.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/nvtx_stub.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx_stub.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx_stub.h [ 30%] Hipifying src/include/p2p.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/p2p.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h [ 30%] Hipifying src/include/param.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/param.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/param.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/param.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/param.h [ 30%] Hipifying src/include/profiler.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/profiler.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h [ 31%] Hipifying src/include/proxy.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/proxy.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h [ 31%] Hipifying src/include/rccl_float8.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/rccl_float8.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h [ 31%] Hipifying src/include/rccl_vars.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_vars.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/rccl_vars.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_vars.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_vars.h [ 31%] Hipifying src/include/register.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/register.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/register.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/register.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/register.h [ 32%] Hipifying src/include/rocm_smi_wrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/rocm_smi_wrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h [ 32%] Hipifying src/include/rocmwrap.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocmwrap.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/rocmwrap.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocmwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rocmwrap.h [ 32%] Hipifying src/include/shm.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/shm.h [ 32%] Hipifying src/include/roctx.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/roctx.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/shm.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/shm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/shm.h [ 33%] Hipifying src/include/signals.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/signals.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/signals.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/signals.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/signals.h [ 33%] Hipifying src/include/socket.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/socket.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/socket.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/socket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/socket.h [ 33%] Hipifying src/include/strongstream.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/strongstream.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/strongstream.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/strongstream.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/strongstream.h [ 33%] Hipifying src/include/timer.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/timer.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/timer.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/timer.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/timer.h [ 34%] Hipifying src/include/transport.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/transport.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/transport.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/transport.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/transport.h [ 34%] Hipifying src/include/trees.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/trees.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/trees.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/trees.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/trees.h [ 34%] Hipifying src/include/tuner.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/tuner.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h [ 34%] Hipifying src/include/utils.h -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/include/utils.h -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h [ 34%] Hipifying src/init.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/init.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc [ 35%] Hipifying src/init_nvtx.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/init_nvtx.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc [ 35%] Hipifying src/misc/alt_rsmi.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/alt_rsmi.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 35%] Hipifying src/misc/api_trace.c -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.c mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/api_trace.c -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.c && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.c [ 35%] Hipifying src/misc/api_trace.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/api_trace.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc [ 36%] Hipifying src/misc/archinfo.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/archinfo.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/archinfo.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/archinfo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/archinfo.cc [ 36%] Hipifying src/misc/argcheck.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/argcheck.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc [ 37%] Hipifying src/misc/ibvsymbols.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/ibvsymbols.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc [ 37%] Hipifying src/misc/ibvwrap.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/ibvwrap.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc [ 37%] Hipifying src/misc/ipcsocket.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/ipcsocket.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc [ 37%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/msccl/msccl_lifecycle.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 38%] Hipifying src/misc/msccl/msccl_parser.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/msccl/msccl_parser.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 38%] Hipifying src/misc/msccl/msccl_setup.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/msccl/msccl_setup.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc [ 38%] Hipifying src/misc/msccl/msccl_status.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/msccl/msccl_status.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc [ 38%] Hipifying src/misc/npkit.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/npkit.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc [ 39%] Hipifying src/misc/nvmlwrap_stub.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/nvmlwrap_stub.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc [ 39%] Hipifying src/misc/param.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/param.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/param.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/param.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/param.cc [ 39%] Hipifying src/misc/profiler.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/profiler.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc [ 39%] Hipifying src/misc/rocm_smi_wrap.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/rocm_smi_wrap.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc [ 40%] Hipifying src/misc/rocmwrap.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/rocmwrap.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc [ 40%] Hipifying src/misc/roctx.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/roctx.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc [ 40%] Hipifying src/misc/shmutils.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/shmutils.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc [ 40%] Hipifying src/misc/signals.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/signals.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/signals.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/signals.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/signals.cc [ 41%] Hipifying src/misc/socket.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/socket.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc [ 41%] Hipifying src/misc/strongstream.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/strongstream.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/strongstream.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/strongstream.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/strongstream.cc [ 41%] Hipifying src/misc/tuner.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/tuner.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc [ 41%] Hipifying src/misc/utils.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/misc/utils.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc [ 41%] Hipifying src/msccl.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/msccl.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc [ 41%] Hipifying src/net.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/net.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc [ 41%] Hipifying src/proxy.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc [ 42%] Hipifying src/register.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/proxy.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/register.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc [ 42%] Hipifying src/transport.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc [ 42%] Hipifying src/transport/coll_net.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/coll_net.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc [ 43%] Hipifying src/transport/generic.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/generic.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc [ 43%] Hipifying src/transport/net_ib.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/net_ib.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc [ 43%] Hipifying src/transport/net_socket.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/net_socket.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc [ 43%] Hipifying src/transport/net.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/net.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc [ 44%] Hipifying src/transport/nvls.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/nvls.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc [ 44%] Hipifying src/transport/p2p.cc -> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc mkdir -p /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-rocm-6.4.0/src/transport/p2p.cc -o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-rocm-6.4.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc cd /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-rocm-6.4.0 /builddir/build/BUILD/rccl-rocm-6.4.0 /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, m/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ antissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParams: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ AllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] | ^~~~~ 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype};In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/bootstrap.cc| : ^~~~~~~9 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{rec/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ vcount * ncclTypeSize(dat/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ atype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ :461:22: warning: unused variable 'payload' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ :378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc :161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllG1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 1 warning generated when compiling for gfx1200. 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, atherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static n1 warning generated when compiling for gfx90a. cclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 1 warning generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 307 | N/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.ccvtxParamsGather paylo:a412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] d412 | const{expr nvtxsPayloadScehemaEntry_tn ScatterSchdema[] = { c | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418o:23: warning: unused variable 'payload' [-Wunused-variable] 418u | NvtxPntaramsScatt er payloa*d{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ nccl/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.ccT:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ ypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: :warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxP461aramsRedu:ce payl22oad{coun:t * nccl TypeSize(dwarning: atatype),unused variable 'payload' [-Wunused-variable] roo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ t, op, datatyp e}; | ^~~~~~~ 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSche/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ ma[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recv/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTcount * ncclTypeSize(datatype), op, datatype/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ }; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ ypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc] :=378: 38{ : | warning: ^~~~~~~~~~~~~~~~~~~ unused variable 'ReduceScatterSchema' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 378 | constexpr 385n | v t xNPvatyxlPoaardaSmcshReemdauEcnetSrcya_ttt eRre dpuacyelSocaadt{treercSvcchoeumnat[ ]* =n c{c l T| y ^~~~~~~~~~~~~~~~~~~p eSize(d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cca:t385a:t27y:p ewarning: )unused variable 'payload' [-Wunused-variable], op, datatype }385; | | N ^~~~~~~v txParamsReduceScatter payload{recvcount * ncclTypeSize(datatyp:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(de), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSatatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ :461:22: warning: unused variable 'payload' [-Wunused-variable] ize(datatype), op, datatype}; | ^~~~~~~ 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ (datatype), peer, dataty/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[compe}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ m->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ warning generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:166:22:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllT/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ oAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 1 warning generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ : warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ 1 warning generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for gfx1100. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx1030. 31 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 31 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 31 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 31 warnings generated when compiling for gfx908. 31 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc 8 warnings generated when compiling for gfx1200. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1201. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx942. 8 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: 77 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t inIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ fo; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ warnings generated when compiling for gfx908. :72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. 2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42::4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable]NvtxEnumRedSchema[ 4 | s] = { | ^~~~~~~~~~~~~~~~~ tatic constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int p2 warnings generated when compiling for gfx906. ow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* col22 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx908. lComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 35 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/msccl.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ m* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/enqueue.cc:1063:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1063 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 35 warnings generated when compiling for gfx1200. 35 warnings generated when compiling for gfx90a. 35 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ 35 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | conste35 warnings generated when compiling for gfx1102. xpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 35 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 35 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResul35 warnings generated when compiling for gfx942. t_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int6/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ 4_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDevi/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.ccceProp_t devProp; | ^~~~~~~ :1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ 35 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ :2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1847:11: warning: unused variable 'stackSize' [-Wunused-variable] 1847 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:1848:19: warning: unused variable 'devProp' [-Wunused-variable] 1848 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2254:26: warning: unused variable 'payload' [-Wunused-variable] 2254 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ , nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2268:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2268 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2553:26: warning: unused variable 'payload' [-Wunused-variable] 2553 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2588:26: warning: unused variable 'payload' [-Wunused-variable] 2588 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx942. 57 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:18: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:85:21: warning: unused function 'commReclaim' [-Wunused-function] 85 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/init.cc:2239:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2239 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx1101. 57 warnings generated when compiling for gfx1200. 57 warnings generated when compiling for gfx906. 57 warnings generated when compiling for gfx90a. 57 warnings generated when compiling for gfx1201. 57 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.hIn file included from :15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 2 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 77 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx90a. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1201. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/register.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/register.cc.o -MF CMakeFiles/rccl.dir/hipify/src/register.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/register.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc 35 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc 57 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1102. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx906. warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc 3 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | i/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.ccnt x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int loca:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ lRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ :275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ :462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.ccIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h::44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t: xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode*In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.ccid, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ :275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: 2 warnings generated when compiling for host. warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* noIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ de) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx906. 31 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx1030. 31 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1100. 31 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx908. 31 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx942. 3113 warnings generated when compiling for gfx908. 31 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc 13 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warning generated when compiling for gfx1101. warning generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ :1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858::72036:7: warning: unused variable 'ncpus' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ : 2036 | int ncpus = system->nwarning: odes[CPU]variable 'gcnt' set but not used [-Wunused-but-set-variable].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ 1858 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ _sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tv/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ e.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248clResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 38 warnings generated when compiling for gfx1102. 38 warnings generated when compiling for gfx942. 3838 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx90a. 38 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 38 warnings generated when compiling for gfx906. 38 warnings generated when compiling for gfx908. 38 warnings generated when compiling for gfx1030. 38 warnings generated when compiling for gfx1100. 38 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.hIn file included from :77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 38 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/trees.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ * collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** reIn file included from quest) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char2 warnings generated when compiling for gfx1100. * attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** nod/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ e) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc 30 warnings generated when compiling for gfx1200. 30 warnings generated when compiling for gfx1101. 30 warnings generated when compiling for gfx906. 30 warnings generated when compiling for gfx942. 30 warnings generated when compiling for gfx908. 30 warnings generated when compiling for gfx1100. 30 warnings generated when compiling for gfx1201. 30 warnings generated when compiling for gfx1030. 30 warnings generated when compiling for gfx90a. 30 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc 30 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/archinfo.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclT/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ opoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 15 warnings generated when compiling for gfx942. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const chIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: ar* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 1010 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ :105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ :233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.hIn file included from :14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc6 warnings generated when compiling for gfx1201. :52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 6 warnings generated when compiling for gfx1102. 6 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 6 warnings generated when compiling for gfx908. 6 warnings generated when compiling for gfx1101. 6 warnings generated when compiling for gfx1200. 6 warnings generated when compiling for gfx1100. 6 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 6 warnings generated when compiling for gfx90a. 6 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for host. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1100. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1102. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 1 warning generated when compiling for gfx1030. 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 11 warning generated when compiling for gfx906. warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 11 warning generated when compiling for host. warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx942. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 6 warnings generated when compiling for host. 11 warning generated when compiling for gfx908. warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/param.cc [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for host. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx906. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/signals.cc [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx942. 12 warnings generated when compiling for gfx1100. warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1101. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/strongstream.cc 2 warnings generated when compiling for gfx1200. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ :602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1100. 22 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx906. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1200. 11 warning generated when compiling for gfx1101. warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ :517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ :712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.ccIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ :724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1201. 4 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 4 warnings generated 75 when compiling for | static ncclResult_t msccgfx1200lXmlGet. AttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx942. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1201. 3 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for host. 3 warnings generated when compiling for gfx90a. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 11 warning generated when compiling for gfx1201. warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 3 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc 1 warning generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_memIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ _desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14 44 | s: tatic long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSucc185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.hess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ :219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, manIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | tstatic long log2iis(long nsa;) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** lis | ^ tenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx1102. 222 warnings generated warnings generated when compiling for gfx1030. when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx1101. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc 22 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 16 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx1100. 16 warnings generated when compiling for gfx1101. 16 warnings generated when compiling for gfx906. 16 warnings generated when compiling for gfx90a. 16 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 16 warnings generated when compiling for gfx908. 16 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for gfx1201. 24 warnings generated when compiling for gfx942. 24 warnings generated when compiling for gfx1100. 24 warnings generated when compiling for gfx1030. 24 warnings generated when compiling for gfx1101. 24 warnings generated when compiling for gfx90a. 24 warnings generated when compiling for gfx1201. 24 warnings generated when compiling for gfx906. 24 warnings generated when compiling for gfx1102. 24 warnings generated when compiling for gfx908. 24 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc8:8: : In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h14: :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:1415: : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:In file included from 14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 22 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 2 warning2s generated when compiling for gfx1201. warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc 16 warnings generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc 24 warnings generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function]In file included from 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uinIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ t32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx908. In file included from 2 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.ccIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx942. warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for host. 2 warnings generated when compiling for host. In file included from [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid =In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = 145 | threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175n: st/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h :271i:19nt: w =warning: unused variable 'ptr' [-Wunused-variable]thr e a271 | dId x. x / W A RPu_iSnItZ6E4;_ t\* | ^p tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xr = recvPtr(0)+ll128Offset; | ^~~ /WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:171:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 171 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hn:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ ; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sis(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from (threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadsIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDeIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ vFunc(AllReduce_RING_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t da/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TIn file included from REE_S/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cppIM:P2L: EIn file included from _/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hM:i11n: MIn file included from a/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hx:_175b: f1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h6:_2714:,19 :n cwarning: clunused variable 'ptr' [-Wunused-variable]F uncAllReduce, FuncMinMax, hip _271b | f l uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ RP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 63 | Primitives, 0, Proto, 0> prims | ^ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 173/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 671 | stepSize(stepSize_ 27 == 0 ? ncclShme | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128O/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hffset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 27 | c/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:onst int bid = ncc2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] lShmem.channelId 145 | uint32 - work->channelL_t daota1,; | ^~~fla In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2In file included from ; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | steIn file included from p/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Size(stepIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Size_ == 0 ? ncclIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPSIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:ZE; \ | ^ 2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h175:145:28: warning: : unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h uint32_t dat:a1, flag1,271 data2, flag2:; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h19:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrieIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tidIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' work->redOp A670r | g ) ; t| i ^d (tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hx):,1070 :g5r:o unote: p(in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested hereg roup), | 1070 | runTreeSplit(tid, nthreads, work); | ^ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432field 'group' will be initialized after field 'stepSize': 78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 670 | t432i | d ( t i d ) ,i fn t(htrieda d().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm7em | .commD.EbFuIfNfES_inzcecsl[DNeCvCFLu_nPcR(OATlOl_RSeIdMuPcLeE_]T/RNECEC_LS_ISMTPELPES_/MsiinzMeaoxf_(bTf)8 _:2 ,s tnecpcSliFzuen_c)A l{l R e| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u c e| , group(group FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hE:,254 :290): note: | in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here^ 254 | Primi/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ht:i611v:e62s:< Tnote: ,expanded from macro 'DEFINE_ncclDevFunc' RedOp, FanAsymme t611r | i c < N CRCuLn_WMoArXk_BDaEtVc_hAy,, /r*eDdiorpe*,/ 0a,l gPor,o tpor,o t0o>, purnirmosl l >| ( ^) .run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h | : 670 : 15 :r unote: nfield 'nthreads' will be initialized after field 'tidInBlock'T reeUpDowns,) ,C OtLiLd_IUnNBRlOoLcLk>((tthirde,a dnItdhxr.exa)d,s ,g rwoourpk()g;r o u| p ^) , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here670 | tid(tid), 432n | t h r e a d si(fn t(htride a().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] initializer order does not match the declaration order [-Wreorder-ctor] 670 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h :63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RedOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cppnote: :12:in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DE F254I | N E _ n c c lPDreivmFiutnicv(eAsle,d u/c*eD,i rFeucntc=M*i/n0M,a xP,r ortcoc,l _0b>f lporaitm8s, N| C ^C L_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h :note: 611in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 565 | 611 | r u n T rReuenUWpoDrokwBna1,, a1l,g oC,O LpLr_oUtNoR,O LuLn>r,o lClO>L(L)_.UrNuRnO(L)L;> (\t i d| , ^ nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hi:d432):,78 :n tnote: hin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herer eads(nthreads) ,432 | t i d I n B liofc k((ttihdr e (t)i.dr(utni(dt)i,d ,n tshurbetand,s (wnotrhkr)e;a d s| ^) , tidInBlock(threadIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp.:x17):,1 :g rnote: oin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereu p(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run();/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h: \670: 15:| ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ti d670( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: 671note: | field 'group' will be initialized after field 'stepSize' stepSiz e670( | s t e p Stiizde(_t i=d=) ,0 n?t hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~o f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tids[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_U), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(ds, work); | s ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:t432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here e432 | if p(tid < subtn)S RunWorkCoill().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ste pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hinitializer order does not match the declaration order [-Wreorder-ctor]: 303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitiv 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1102. 1212 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h271 | : 75 : 7 : warning: unused variable 'w' [-Wunused-variable] uint64_t* ptr = recvPtr(0)+ l75l | 1 28 O f f sbeatr;r i e| r ^~~_ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1In file included from ,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27In file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h tid:I670n:B15l:o warning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ . 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ buffSizes[NCCL_PROTO_SIMPLE]/NCCL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' _S TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> p:rims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nnthrethrads, woeads(rk); | nth ^ reads), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:tidInBlock432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here (threadIdx.x), group(gro 432 | up), if (tid | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ < 671 | subtn) RunWor stkColl().run(tid, subtn, work); | ^ epSize(stepSize_ == 0 ? ncclShmem.com/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppm.buffSize:22:1: note: s[Nin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here CCL_PROTO_SIMPLE] /NCCL22 | DEF_STEPS/sizeofINE_ncclDe(T) vFunc(: stepSizAllRede_) uce_R{ ING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | , ty, Primit redoives FanA, algo, psymmeroto, untriroll>().run();c< \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid1, NCCL_MAX_D),EV_A nthRITY>,read /s(nthrea*Dids), tidInBlrock(thect=*/0, Proto, 0> prims readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorikdC)o,l lng(r)ou.pr(ugnr(otuipd),, s u| b ^~~~~~~~~~~t n, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 18 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 18 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Id - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | constIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreIn file included from a/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ > prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMin15:Max, doubl warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ e, NCCL_ALGO_RING, NCCL_PRO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBhlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid (670t | id ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from a/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffseIn file included from t; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ E; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from 271 | uint6/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from 4/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: _warning: unused variable 'data1' [-Wunused-variable] 145 | t uint* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmemIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZ218:E15: warning: unused variable 'bid' [-Wunused-variable] ;218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hPROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h670 | : 670 : 15 :t iwarning: d(initializer order does not match the declaration order [-Wreorder-ctor]t id), n 670 | t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiz e671( | s t e p Ssitzeep_S i=z=e (0s t?e pnSciczleS_h m=e=m .0c o?m mn.cbculfSfhSmiezme.sc[oNmCmC.Lb_uPfRfOSTiOz_eSsI[MNPCLCEL]_/PNRCOCTLO__SSTIEMPPSL/Es]i/zNeCoCfL(_TS)T E:P Ss/tseipzSeiozfe(_T)) {: s| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e p S| i group(groupz e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t671id), nthr | eads(nthrea ds), tidInBlock(thread Idx.x) , g roup(sgroup), t| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ e 671 | stpepSize(sStepSizei_ =z= 0 ? e(stepSize_ == 0 ? ncclSnchclShmem.cmomm.buffSiezes[NCCmL.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | fl/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ agThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor:k421):; 9 :| ^ note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 421 | 7 | D EpFrIiNEm_sn(ctcildD,e vnFtuhnrc(eAaldlsRe,d utcree_TeR-E>Ed_SoIwMnP,L Et_rMeineM-a>x_duo3w2n_,2 , wnocrckl-F>usnecnAdlbluRfefd,uc ew,o Frukn-c>MriencMvabxu,f fui,n tw3o2r_kt-,> rNeCdCOLp_AArLgG)O_;T R E| E ^, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:: 611note: :in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here62 : note: expanded from macro 'DEFINE_ncclDevFunc' 1070 | 611 | r un Tr eeRSpunlWiortk_,U NaROlLgLo,> (ptird,o ntot,hr euandrso,l l>wo(r)k.);r u| ^n (); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrea d432s | ) , t i d IniBfl o(ctki(dth r().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, un/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from 670 | ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:d2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h(:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:t175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:i29: dwarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] ) ,506 | tid(tidn), nthtreads(hnthrerads), weid(atid%WdARP_SIZs(En),threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMa warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ x_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primiti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROL 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShme/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hm.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(Threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] : 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670670 | | tid(tid) , tid(tid), nthreads(nthreads), tidIn nthrBeads(nthrelads), tidInBock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | run:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hdIn:B670l:o15:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ 670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h | : 670 : 60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] s(nthreads), tidInBlock(threa d670I | d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ cclShmem.comm.buffSi671zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Pro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hto:,670 :015>: pwarning: initializer order does not match the declaration order [-Wreorder-ctor] r 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 18 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h18 warnings generated when compiling for gfx1100. :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670Idx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.heof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work);/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().rutepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmen(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254::90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth_rMinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriedx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bIn file included from arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Procomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(:threadIdx670.x),: group15(group), :| ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h18 warnings generated when compiling for host. :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hhmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AltlRherdeuacdes_)T,R EwEi_SdI(MtPiLEd_%PWreAMRuPl_SSuIm_ZbEf)8,_ 2w,a rnpc(ctliFdu/nWcAARlPl_RSeIdZuEc)e,, F| u ~~~~~~~~~~~~~~~~~~n c P| r stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)e MulSum, rccl_bfloat8, N507CC | L _ A L GwOa_rTpRIEEn,B NlCoCcLk(_tPhRrOeTaOd_SIIdMxP.LxE/,W A2R)P _ S| ^ IZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h : 611| : warp(tid/WARP_SIZE62 : note: expanded from macro 'DEFINE_ncclDevFunc' 508 | 611 | f l a g TRhurneWaodr(k(Btaitdc%h4<)c=o=l3l),, tgyr,o urpe(dgorpo,, a| l ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~g o ,| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3p roto, unroll>() .r509u | n () ; s\t e p| ^S ize(ncclShmem./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hc:o670:m15m:. bnote: ufield 'nthreads' will be initialized after field 'tidInBlock'f fSizes[NCCL_PRO T670O | _ L L 1 2ti8d](/tNidC)C, Ln_tShTrEePaSd/s(snitzhreeoadfs()u,i ntti6d4I_ntB)lo)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h : 503t:i9d:( tinote: din instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here), nthreads(nthreads) ,503 | t i d I n B l o cpkr(itmhsr(etaiddId-xn.txh)r,e agdrsoSuppl(igtr,o unpt)h,r e a| d ^~~~~~~~~~~s -nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), nAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Ptr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = nc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | stepS ize(step Size_ P== 0 ? rimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, uhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hstepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, su/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here bt n63, | w o r kP)r;i m i| t ^i ves, 0, Proto,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp :022>: 1p:r inote: min instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested heres | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h22: | 558D:E5F:I Nnote: Ein instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here_ ncclDevFun c558( | A l l R erduuncRei_nRgIc(ltFiudn,c AnltlhRreedaudcse,, wFournkc)P;r e M| u ^l Sum, half, NCCL_ALGO_R/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hI:N432G:,78 :N Cnote: Cin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereL _PROTO_SIMPLE ,432 | 4 ) | ^ if (tid < subt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hn:)611 :R62u:n Wnote: oexpanded from macro 'DEFINE_ncclDevFunc'r kCollo(p)<.tryu>n,( taildg,o ,s upbrtont,o ,w ournkr)o;l l >| ( ^) .run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 22 | DEFINE_ncclD e670v | F u n c (tiAdl(ltRiedd)u,c en_tRhIrNeG_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRe/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tistepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.htid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in t29 | co nst int w = twhreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - woIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ rk->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(noctk(threadIhdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(All/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncnote: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp: 229: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h | : 27 : 15c:o nwarning: stunused variable 'bid' [-Wunused-variable] int w = threadIdx.x/WARP _27S | IZ E ; \c o n| s ^t int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hnote: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | consIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ em.channelId - work->channelLo; | ^~~ ARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tidx)),, ngtrhoruepa(dgsr(onutph)r,e ad s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Lo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channe/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hl:I218d: 15-: wwarning: ounused variable 'bid' [-Wunused-variable] rk->chan n218e | l Lo ; c o| n ^~~ st int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x18In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | In file included from ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WAIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ RP _671S | I Z E ) ,s t e| p ~~~~~~~~~~~~~~~~~~S i z| e stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( stepSize_ == 0 ? ncclShmem .c507o | m m . b uwfafrSpiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP/LWEA]R/PN_CSCILZ_ES)T,E P S| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s i z| e warp(tid/WARP_SIZEo f(T) : step S508i | z e _ ) f{l a g| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd ((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hi:z254e:(90n:c cnote: lin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereS hmem.comm.buffSi z254e | s [ N C C L _PPrRiOmTiOt_iLvLe1s2<8T],/ NRCeCdLO_pS,T EFPaSn/Assiyzmemoeft(ruiicn, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h9::565 :note: 5:in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565503 | | r u n T rpereiUmpsD(otwind<-Tn,t hRreedaOdps,S pPlriott,o Snitmhprleea&,t rCeOeL-L>_uUpN,R OtLrLe>e(-t>iddo,w nn,t hwroerakd-s>,s ewnodrbku)f;f , | w ^o rk->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 432 | 1070 | i f ( triudn Tr(ottiod,, CnOtLhLr_eUaNdRsO,L Lw>o(r)k.)r;u n (| t ^i d, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cppnote: :in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here7 :1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 432 | i f7 | (DtEiFdI Nn(c)A.lrluRne(dtuicde,, sFuubntcnP,r ewMourlkS)u;m , | u ^i nt32_t, NCCL_ALG/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cppO:_5T:R1E:E ,note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested hereN CCL_PROTO_SIMPLE ,5 | 2D)E F I| N^E _ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611: RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h: :670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:15::670 :note: 15:field 'nthreads' will be initialized after field 'tidInBlock' warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo ck670( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s ), tidIn/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hB:l670o:c60k:( note: tfield 'group' will be initialized after field 'stepSize'h readIdx.x), g r670o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_r eads(nthreads), ti d671I | n Bl o c ks(ttehprSeiazdeI(dsxt.exp)S,i zger_o u=p=( g0r o? unpc)c,l S h| ^~~~~~~~~~~m em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(ntchreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.k(comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/stizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group hre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TRE.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(sEtepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, ), | ^~~~~~~~~~~ uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :15: warning: 670 | tid(tid), nthreads(ntinitializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h75:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: | unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const intIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145 | uint32_t data1, flag1, data2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h,:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelIdIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in - work->channelLo; | ^~~ t w = thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t daIn file included from ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ id = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 18 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32In file included from _t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp :d2a: tIn file included from a/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h1:,11 : fIn file included from la/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hg:1174,: d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hata:275,: 7f:l awarning: g2unused variable 'w' [-Wunused-variable]; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 18 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo;In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp :| 2: ^~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h18 warning:s218 generated: when compiling for 15gfx942:. warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tix), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | co/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15In file included from : warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 22 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp_:SI2M: PLE, In file included from 4) /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h :| 11^: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hinitializer order does not match the declaration order [-Wreorder-ctor]: 611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pr o670t | o , u ntriodl(lt>i(d)).,r unn(t)h;r e\a d s| ^( nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hx:.670x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_670 | tid(tid), nt h671r | e a d s (snttehprSeiazdes()s,t etpiSdiIzneBl_o c=k(=t h0r e?a dnIcdcxl.Sxh),m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCCL_PROTO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h_:S670I:M60P:L Enote: ]field 'group' will be initialized after field 'stepSize'/ NCCL_STEPS/si z670e | o f ( Tt)i d:( tsitde)p,S inzteh_r)e a{d s (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Pro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd_b:f6701:615_:4 ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n cclFuncAllReduce, FuncProd, hip_ b670f | l o a t 1t6i,d (NtCiCdL)_,A LnGtOh_rTeRaEdEs,( nthNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc,k (4t)h r e| a^d Idx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~611 : 62| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ note: expanded from macro 'DEFINE_ncclDevFunc' 671611 | | sRtuenpWSoirzkeB(asttcehpm,e ma.lcgoom,m .pbruoftfoS,i zuensr[oNlClC>L(_)P.RrOuTnO(_)S;I M\P L E| ] ^/ NCCL_STEPS/sizeof(T) : stepSize_) { /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :| 670 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 15 :| group(groupnote: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h):,303 :n90t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herea ds(nthreads), t303i | d I n B l o cPkr(itmhirteiavdeIsd, /* D670i | r e c t =t*i/d0(,t iPdr)o,t on,t h0r>e apdrsi(mnst h r| e ^a ds), tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h(:t565h:r5e:a dnote: Iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hered x.x), group( g565r | o u p ) ,r u n| T ^~~~~~~~~~~r eeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here ste 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ pSiz/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.he(ste:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint64_tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ * ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'bid' [-Wunused-variable] 218 | const int bid =In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx942. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 22 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | co/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: In file included from unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hlag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 27 | const int bid = ncclShmem.cha/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nnelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxIn file included from .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROT/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Pro18 warnings generated when compiling for gfx908. d_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | In file included from const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | cons/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[N/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :In file included from 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ dOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15fSizes[NCCL_PROTO_SIMPL:E]/NCCL_STEPS/sizeof(T) : stepSi ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:warning: initializer order does not match the declaration order [-Wreorder-ctor]303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hs), tidInBlock(threadIdx:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMP.LxE), ,2 )g r o| u^p (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 671 | R u nsWtoerpkSBiaztec(hsl,S hamlegmo.,c opmrmo.tbou,f fuSnirzoelsl[>N(C)C.Lr_uPnR(O)T;O _\S I M| P ^L E]/NCCL_STEPS/sizeof(T) : stepSi/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hz:e670_:)15 :{ note: field 'nthreads' will be initialized after field 'tidInBlock'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h):,63 :t56i:d Inote: nBin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested herel ock(threadIdx .63x | ) , g rPoruipm(igtriovueps)<,T , | R ^~~~~~~~~~~~~~~~~e dOp, FanS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hy:m670m:e60t:r inote: cfield 'group' will be initialized after field 'stepSize'< 1>, 0, Prot o670, | 0 > ptriidm(st i d| ) ^, nthreads(nth/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hr:e558a:d5s:) ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested heret idInBlock( t558h | r e a d Irduxn.Rxi)n,g (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[N stepSizCe(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group CL_PROTO_SIM: note: Pexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here tid(tid), nthrea d63s | ( n t h rPeraidmsitiv)e,s r,o u0p,( grPoruopt)o,, 0| > ^~~~~~~~~~~ prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_S:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Pri/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, su:670:15: btwarning: initializer order does not match the declaration order [-Wreorder-ctor] n, work); 670 | tid | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buff/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hid:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Prim(tid), nthreads(ntitives, /*Dreirect=*/0, Proto, 0>ad prims | ^ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:118: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 warnings generated when compiling for host. | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = nccIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLLo>up), | ^~~~~~~~~~~ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here eads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNR/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tiOdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeoLfL(>T()t i:d ,s tnetphSriezaed_s), {w o r| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ) ;| group(group | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h: :note: 432:in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here78 : note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 254 | P r432i | m i t i v e si,, P/r*oDtior,e cCtO=L*L/_0U,N RPOrLoLt>o(,) .0r>u np(rtiimds, s| u ^ btn, work); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h| : ^ 565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 565 | runTreeUpDo w17n | _,P rCoOdL_Lf_6U4N_R4O,L Ln>c(ctliFdu,n cnAtlhlrReeaddusc,e ,w oFrukn)c;P r o| d ^, double, NCCL_ALGO_TREE, NCC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hL:_432P:R78O:T Onote: _in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereS IMPLE, 4) | ^ 432 | if (tid /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h<: 611s:u62b:t nnote: )expanded from macro 'DEFINE_ncclDevFunc' RunWorkColl>(,) .arlugno(,t ipdr,o tsou,b tunn,r owlolr>k());. r u| n ^( ); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 17 | DEFINE_ncclDev F670u | n c ( A ltliRde(dtuicde)_,T RnEtEh_rSeIaMdPsL(En_tPhrroeda_dfs6)4,_ 4t,i dnIcncBllFoucnkc(AtlhlrReeadduIcdex,. xF)u,n cgPrrooudp,( gdroouubpl)e,, N| C ^~~~~~~~~~~~~~~~~C L_ALGO_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hT:R670E:E60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PROTO_SIMP L670E | , 4 ) t i| d^( tid), nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ha:d611s:(62n:t hnote: rexpanded from macro 'DEFINE_ncclDevFunc'e ads), tidIn B611l | o c k ( tRhurneWaodrIkdBxa.txc)h,< cgorlolu,p (tgyr,o urpe)d,o p <| t ^~~~~~~~~~~y >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(c)o.mrmu.nb(u)f;f S\i z e| s ^[ NCCL_PROTO_SIMPLE]/NC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hC:L670_:S15T:E Pnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'/ sizeof(T) : ste p670S | i z e _ )t i{d (t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | n group(groupt hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h: 63670: | 56 : note: tiin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hered (tid), nthreads(n t63h | r e a d sP)r,i mtiitdiIvneBslr,o u0p,) ,P r o| t ^~~~~~~~~~~o , 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warningsIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ 18 warnings generated when compiling for gfx1101. | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp2:: 2In file included from : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h11:: 11In file included from : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h175:: 175/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h: :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h271::19271:: 19warning: :unused variable 'ptr' [-Wunused-variable] warning: unused variable 'ptr' [-Wunused-variable] 271 | 271 | u i n t 6 4 _uti*n tp6t4_t* ptr = recvPtr(0)+ll128Offset; | ^~~ r = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp254::290: :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hnote: :11in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 254 | 670 | tid(tid), nthreads(nthrPeriamdsi)t,i vetis, /*D i671 | r e ct =s*t/0e,pS Pizroe(tos,t ep0S> izpre_im s== | ^0 ? ncclShmem.comm./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hb:u565ff:5Si:z note: esin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here [NCCL_PR OT565 | O _S IM rPLunE]T/NrCeeCLU_pSDoTEwnPS, COLL_UNROLL>(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hi:d254,:90 n: thnote: rin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heree ads, w o254 | r k) ; | ^P rimitives, 0, 2, 2>::run' requested herea nAsym me432 | tr ic < N CCifL_ (MAtiXd_ DE,n W/or*DkCiroelcltA lprgoi,m sP ro| t ^o, COLL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:_U565:NR5O:L note: L>in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here ().run 565( | t id ,r sunubTtrene,U wpDorokwn)<; T, | ^Re dOp, P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppro:7t:o1S:i note: mpin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested herele <1, 1, 7C | ODLELF_UINNER_OnLLcc>l,De CvFOuLncL_(AUlNlRROLedLu>(cet_idTR, EnEt_ShrIMeaPdLsE,_P wrod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group _SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here, 1>, /*Direct=*/0 ,254 | P r o t o , P0r>i mpirtiimvse s <| T ^, RedOp, FanAsymmetri/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hc:<565N:C5C:L _note: Min instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereA X_DEV_ARITY, 5651 | > , / *rDuinrTercete=U*p/D0o,w nPp ,p rPirmost o S| i ^m ple<1, 1, CO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hL:L565_:U5N:R Onote: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereL >, COLL_UNRO L565L | > ( t i dr,u nnTtrhereeUapdDso,w nw432,: 78C:O Lnote: Lin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here_ UNROLL>(tid, n432t | h r e a d s ,i fw o(rtki)d; < | s ^u btn) RunWorkColl, 0, 2, 2>::run' requested hereo , Proto, COLL _432U | N R O L L > (i)f. r(utni(dt i, 0, 2, 2>::run' requested hereo , COLL_UNROLL>() .7r | uDnE(FtIiNdE,_ nscucbltDne,v Fwuonrck()A;l l R| e ^d uce_TREE_SIMPLE_Prod_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppf:87_:21,: nnote: cin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested herec lFuncAllReduce, F7u | nDcEPFrIoNdE,_ nrccccllD_efvlFouantc8(,A lNlCRCeLd_uAcLeG_OT_RTEREE_ES,I MNPCLCEL__PPrRoOdT_Of_8S_I2M,P LnEc,c l2F)u n c| A^l lReduce, Func/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hP:r611o:d62,: rnote: cexpanded from macro 'DEFINE_ncclDevFunc'c l_float8, NC C611L | _ A L G OR_uTnRWEoEr,k BNaCtCcLh_^, algo, proto, un/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hr:o611l:l62>:( )note: .expanded from macro 'DEFINE_ncclDevFunc'r un(); \ | ^ 611 | RunWorkBatch, a l670g | o , p rtoitdo(,t iudn)r,o lnlt>h(r)e.ardusn((n)t;h r\e a d| s ^) , tidInBlock(threadId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hx:.670x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(group), | ^~~~~~~~~~~~~~~~~ 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:i670d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nthreads(n t670h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dIdx.x)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h,: 670g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIdx.x), group(group), | ^~~~~~~~~~~ :670::565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPL:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsE), tidInBl_oPck(rod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_threadIdx.x),ALGO_TREE, NC groCL_PROTO_SIMPLEup(gr, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl ( ) . rRuunn(Wtoirdk,B astucbht, algo, pro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppt:o17,: 1u:n rnote: oin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herel l>().run(); \ | ^17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h_:f6708:_154:, note: ncfield 'nthreads' will be initialized after field 'tidInBlock'c lFuncAllReduce, FuncProd, r670c | c l _ f ltoiadt(8t,i dN)C,C Ln_tAhLrGeOa_dTsR(EnEt,h rNeCaCdLs_)P,R OtTiOd_ISnIBMPlLoEc,k (4t) h r| ^e adIdx.x), grou/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hp:(611g:r62o:u pnote: )expanded from macro 'DEFINE_ncclDevFunc', | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h611: | 670 : 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n WorkBatch),, anltghor,e apdrso(tnot,h ruenardosl)l,> (t)i.drIunnB(l)o;c k\( t h| r ^e adIdx.x), group(gr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ho:u670p:)15,: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Prim).run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ itives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx90a. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp 22 warnings generated when compiling for gfx90a. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 22 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | baIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL> warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group):670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670::15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffsetIn file included from ; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from 29 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp : 2 : In file included from c/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.ho:n11s: tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hi:n174t: w/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h =: 145t:h14r:e awarning: dIunused variable 'data1' [-Wunused-variable]d x.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 27 | 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AlIn file included from l/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cppR:e2du: cIn file included from e_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hR:I11N: GIn file included from _/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hS:I173M: PL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hE_:P670r:o15d:_ uwarning: 6initializer order does not match the declaration order [-Wreorder-ctor]4 _2, ncclFuncAllReduce, FuncProd, uint64_t, NC C670L | _ A L G Ot_iRdI(NtGi,d )N,C CnLt_hPrROeTaOd_sS(InMtPhLrEe,a d2s)) , | t^i dInBlock(threadIdx.x),/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h :g611r:o62u:p (note: gexpanded from macro 'DEFINE_ncclDevFunc'r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 611 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | RunWorkBatch< c671o | l l , tsyt,e prSeidzoep(p,S iazlego_, =p=r o0t o?, nucncrloSlhlm>e(m)..croumnm(.)b;u \f f S| i ^z es[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hS:/670s:15i:z enote: ofield 'nthreads' will be initialized after field 'tidInBlock'f (T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~670 | | group(group tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hp:(254g:r90o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :254 | 670 : 60 : note: field 'group' will be initialized after field 'stepSize'P rimitivesl,o c/k*(Dtihrreecatd=I*d/x0.,x )P,r ogtroo,u p0(>g rporup)i,ms | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()In file included from .run(tid, subtn, work); | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:unused variable 'w' [-Wunused-variable]5: 1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 755 | | D E F I N Eb_anrcrcileDre_vbFyu_ngcr(oAulpl(R)e;d u c| e ^~~~~~~~~~~~~~~~~~ _TREE_LL128_P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hr:o29d:_15u:6 4note: _expanded from macro 'barrier_by_group'2 , ncclFuncA l29l | R e d u ccoen,s tF uinntc Pwr o=d ,t hurienatd6I4d_xt.,x /NWCACRLP__ASLIGZOE_;T R\E E ,| ^N CCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from 145 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp : 2 : In file included from u/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hi:n11t: 3In file included from 2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h_:t174 : d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hat:a1451:,14 :f lwarning: aunused variable 'data1' [-Wunused-variable]g 1, data2, flag2; | 145 ^~~~~ | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h : 145u:i35n:t 3warning: 2unused variable 'flag2' [-Wunused-variable]_ t data 1145, | f l a gu1i,n td3a2t_at2 ,d aftlaa1g,2 f;l a g| 1 ^~~~~, data2,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h :f145l:a21g:2 ;warning: unused variable 'flag1' [-Wunused-variable] | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bi/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ d = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid In file included from =/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cppa:l2g: In file included from o/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h,: 11p: rIn file included from o/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.ht:o174,: u/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hnr:o75l:l7>:( )warning: .unused variable 'w' [-Wunused-variable]r un(); \ | ^ 75 | barrier_b/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hy:_670g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock') ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :67029 | : 15 : note: texpanded from macro 'barrier_by_group'i d(tid), nt h29r | e a d s (cnotnhsrte aidnst) ,w t=i dtIhnrBelaodcIkd(xt.hxr/eWaAdRIPd_xS.IxZ)E,; g\r o u| p ^( group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlrun(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670ock(:t15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadshreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_R/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | PrimitiveING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.headIdx.x), gro:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algoadIdx.x), group(group), | ^~~~~~~~~~~ , proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1101. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Off/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelIdIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp::22: : In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h11:: 11In file included from : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h175: :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h175: :271/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:19::80 :warning: 5unused variable 'ptr' [-Wunused-variable]: warning: unused variable 'w' [-Wunused-variable] 271 | 80 | u i n t6 4b_atr*r ipetrr_ =b yr_egcrvoPutpr(()0;) + ll| 1 ^~~~~~~~~~~~~~~~~~2 8Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < suIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ btn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives175,: //builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h*:D80i:r5e:c twarning: =*unused variable 'w' [-Wunused-variable]/ 0, Proto, 0> prim s80 | | ^ barrier_by_g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hr:o565u:p5(:) ;note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here | ^~~~~~~~~~~~~~~~~~ 565 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h : 29: 15 r:u nnote: Texpanded from macro 'barrier_by_group'r eeUpD o29 | w n< T ,c RonesdOt pi, ntPr wot =oS ithmprleae\ , | C ^OL L_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) Ru/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'barrier_by_group' 29 | nWorkColl().run(tid, subtn, work); | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 670 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h== 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/:218:15: warning: unused variable 'bid' [-Wunused-variable]N C218 | Cconst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ L_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ (group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hiz:e670_:)15 :{ warning: | initializer order does not match the declaration order [-Wreorder-ctor] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hr:e303a:d90s:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heret idInBlock(threadIdx. x)303, | g ro u p ( gProruipm)i,t i v| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s <| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ , RedOp, FanAs 671y | m m e t rsitecpme,m ./c*oDmimr.beucfftS=i*ze/s0[,NC CPLr_oPtRoO,TO _0SI>M PpLrEi]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),m st i d| I ^nB lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5 :671 | note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here stepSize(stepSize_ =565= | 0 ? rnucncTlSrhemeeUmp.Dcoowmnm<.Tb,u fRfeSdiOzpe,s [PNrCoCtLo_SPimRpOlTeO<_S1I,MP L1E,] /CNOCLCLL__USNTREOPLSL/>s, iCzOeLoLf_(UTN)R O:L Ls>t(eptSiidz,e _n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s , | group(groupw ork); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: 432 | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here if (tid 63< | s u b tPnr)i mRiutinvWeosr,, P0r,o tPor,o tCoO,L L0_>U NpRrOiLmLs > (| ) ^. run(tid, sub/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.ht:n558,: 5w:or knote: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here; | ^ 558 | runR/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cppi:n17g:<1T:, note: Rin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heree dOp, Proto, COLL_UNRO L17L | >D(EtFiIdN,E _nntchcrleDaedvsF, uwnocr(kA);l l R| e ^d uce_TREE_SIMPLE_Sum_b/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hf:14326:_784:, note: nin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herec clFuncAllReduce ,432 | F u n c S u mi,f h(itpi_db f().run(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hid:,611 :s62u:b tnote: nexpanded from macro 'DEFINE_ncclDevFunc', work); | ^ 611 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cppR:u22:n1W:o rnote: kin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereB atchE,_ naclclgDeov,F unpcr(oAtlol,Re duuncreo_lRlIN>G(_)S.IrMuPLnE(_)S;u m\_ b f| 1 ^6 _4, ncclFuncAllReduce, Func/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hS:um670,: 15h:i pnote: _field 'nthreads' will be initialized after field 'tidInBlock'b float16, NCCL_ALGO _670R | I N G , tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs,( n4t)h r e| a^d s), tidInBlock(th/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hr:e611a:d62I:d xnote: .expanded from macro 'DEFINE_ncclDevFunc'x ), group(g r611o | u p ) ,R u n| W ^~~~~~~~~~~~~~~~~o rkBatch, a l670g | o , p rottiod,( tuindr)o,l ln>t(h)r.eraudns(()n;t h\r e a| ^d s), tidInBlock(threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd:I670d:x15.: xnote: )field 'nthreads' will be initialized after field 'tidInBlock', group(group), 670 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work);/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11 | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | >().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrea tidds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 670 | 671 | t i ds(tteipdS)i,z en(tshtreepaSdisz(en_t h=r=e a0d s?) ,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ izes[N C671C | L _ P R OsTtOe_pSSIiMzPeL(Es]t/eNpCSCiLz_eS_T E=P=S /0s i?z enocfc(lTS)h m:e ms.tceopmSmi.zbeu_f)f S{i z e| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~[ N C| C group(groupL _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(step/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hSize_: =670= :015 :? nwarning: ccinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC CL670_ | S T E P St/isdi(zteiodf)(,T )n t:h rsetaedpsS(inzteh_r)e a{d s )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t i| group(groupd InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h671: | 303 :90 : note: stin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heree pSize(stepSize_ 303= | = 0 ? nPcrcilmSihtmievme.sci,z /e*oDfi(rTe)c t:= *s/t0e,p SPirzoet_o), {0 > | p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r i m| s group(group | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h : 63 :r56u:n Tnote: rein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested heree UpDown,m mCeOtLrLi_cUO,LL >0(,t iPd,r onttoh,r e0a>d sp,r iwmosr k )| ; ^ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 558 | runRing< T432, | R e d O p ,i fP (rtoitdo ,< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , 0, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInProto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hroup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uin uint32_t y, head, mtantissa; | ^ 32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 22 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp145::214: :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hwarning: :11unused variable 'data1' [-Wunused-variable]: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 145 | uint32_t da ta751 | , f l a g1, bdaartrai2e,r _fblya_gg2r;o u p| ( ^~~~~) ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: 145note: | expanded from macro 'barrier_by_group' uint32_t d a29ta | 1 , f l acgo1,n sdta tian2t, wfl a=g 2t;h r | e ^~~~~a dIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h.:x145:/28W:A Rwarning: Punused variable 'data2' [-Wunused-variable]_ SIZE; \ | 145 ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_In file included from by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSeize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hadIdx.x), group(group), | ^~~~~~~~~~~ :254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, //builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_S*Dium_f16_2, ncclFuncAllReducrect=*/0, Proto, 0> prims | ^ e, FuncSum/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565, ha:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here l 565 | f, NCC runTreL_eALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCLIn file included from _ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15 tid(t: id), nthreads(warning: ninitializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | th tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h s:t670e:p15S:i zwarning: e_initializer order does not match the declaration order [-Wreorder-ctor]) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h,: 63t:i56d:I nnote: Bin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested herel ock(threadI d63x | . x ) , Pgrriomuipt(igvreosu, 0 ,671 | P r o t os,t e0p>S ipzrei(msst e p| S ^i ze_ == 0 ? ncc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hl:S558h:m5e:m .note: cin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested hereo mm.buffSiz e558s | [ N C C Lr_uPnRROiTnOg_:( tsitde,p Snitzher_e)a d{s , | w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| ) group(group; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h::7863:: 56note: :in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 432 | 63 | Pirfi m(ittiid v R, e0dO,p P,r Aoltogo, ,0 P>r oprtio,m s C O| L ^L _UNROL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hL:>558(:5).: runote: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested heren( tid , 558s | u b t n,r unwRorikn);g< T| , ^ RedOp, P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppro:12t:o,1: C note: OLin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereL _UNROL L>12 | (tDiEdFI, NnEt_hncreclaDdesv,F uwoncrk(A);ll R| ed ^u ce_RING_SI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:M432P:L78E_: Snote: umin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here _f16_2, 432 | n c cl Fu nicf Al(ltiRed du) ( )| .^r un(tid, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hs:u611:bt62n: , note: expanded from macro 'DEFINE_ncclDevFunc'w ork); 611 | | ^ RunWork/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppBa:t12c:1h<: cnote: olin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herel , ty, r 12ed | DopE_n,c acllgDeov,F uprncot(Aol,l Ruenrduolcel_>(R)IN.Gru_SnI()MP; L\E_ S u| ^m_ f16_2, ncclFuncAllRe/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd:u670c:e15:, Fnote: field 'nthreads' will be initialized after field 'tidInBlock'un cSum, 670ha | l f , tNCidC(L_tiAdL)GO, _nRtINhrG,e adNCs(CLn_thPRrOeaTOds_)S,IM PtiLEdI, n2B)lo c| k(^ thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hI:dx611.:62x:) , note: expanded from macro 'DEFINE_ncclDevFunc'gr oup(g 611ro | up ) , R| ^~~~~~~~~~~~~~~~~u nWo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:rk670:Ba60t:c note: hd,), anltgohr, eaprdsot(no,t uhnrreoladl>s)(), .rtuidn(I)n;B l\o c| k( ^ threadI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd:x670.x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gr 670o | u p) , t| id ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cppr:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.he:11: In file included from cvPtr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h(0)+ll128Offset; | ^~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ clShmem.channelId - work->channelLo;/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ *Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tcclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gup),roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), ti 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here dInBlock(threadIdx.x), group(grou565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, flp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sioat, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unzeof(roll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' T) : stepSize_ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpD/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ho:w670n:<15T:, warning: RedOpinitializer order does not match the declaration order [-Wreorder-ctor], ProtoSimple<1, 1, COLL_UNROL L670> | , C O LtLi_dU(NtRiOdL)L,> (nttihdr,e andtsh(rnetahdrse,a dwso)r,k )t;i d I| n ^B lock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ho:u432p:)78,: note: | in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 432 | 671 | i f (sttiedp SM(P)L.Er]u/nN(CtCiLd_,S TsEuPbSt/ns,i zweoorfk()T;) :| ^s tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp :17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hN:E254_:n90c:c lnote: Din instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heree vFunc(AllReduce _254T | R E E _ S I MPPrLiEm_iStuimv_efs3<2T_,4 ,R endcOcpl,F uFnacnAAlslyRmemdeutcrei,c L,G O/_*TDRiErEe,c tN=C*C/L0_,P RPOrToOt_oS,I M0P>L Ep,r i4m)s | | ^ ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h::5611:: 62note: :in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here note: expanded from macro 'DEFINE_ncclDevFunc' 565611 | | rRuunnTWroerekUBpaDtocwhn<p,l ea>,( )C.OrLuLn_(U)N;R O\L L >| ( ^t id, nthreads, work/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h):;670 : 15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h(:t432i:d78):, note: nin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested heret hreads(nthrea d432s | ) , t i d IinfB l(otcikd( t ( ) . rtuind((ttiidd,) ,s unbtthnr,e awdosr(kn)t;h r e| a ^d s), tidInBlock(threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cppd:I17d:x1.:x )note: ,in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here group(group )17, | D E| F ^~~~~~~~~~~I NE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp 22 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int :b366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ id = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tiIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ d(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtIn file included from , algo, proto, unroll>().run(); \ | ^ n, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, n | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: cclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEfP(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId565 | runTreeUpDown< T-, wReodrOkp-,> cPhraontnoeSliLmop;le <1 , | 1 ^~~, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h::11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:1175:7: warning: unused variable 'w' [-Wunused-variable] In file included from : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bar/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: rIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRedIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work-u>ce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ chann/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ elLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: rIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: owarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t tid(tid), ntohreads(nt,hreads), tidInBlock(threadIdx.x), group(gr oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(Idgroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tx.id), nx)threads(nthreads), tidInBlock(threadId, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | x.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFuncSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ (/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppAllReduce:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ _TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from <1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.bufads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkC:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ o; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | ba145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | ta1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Lo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80::5: warning: unused variable 'w' [-Wunused-variable] 2 80: | In file included from barrie/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hr_b:y_grou11p(); : | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'bid' [-Wunused-variable] 366 | const i/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nt bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrimitIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < suives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here btn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hm.buffSizes[NCCL_PROTO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here _SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t 558 | id(tid), nthread rusnRing(ti,d, nthreads group(group), , wor| k); | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herei 432 | z if e_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRed(tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nth/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hreads, work:670):15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ;670 | t id(tid), nt hreads(nthr| eads), tidIn ^Block(threa dIdx.x), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hgroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlo:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herecclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccliShdInBlock(threamdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid,em nthreads, .work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ 7 | DEFINE_n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hc:c611l:D62e:v Fnote: uexpanded from macro 'DEFINE_ncclDevFunc'n c(AllReduce _611T | R E E _ SRIuMnPWLoEr_kSBuamt_cuh6<4c_o2l,l ,n ctcyl,F urnecdAolplu,c ea,l gFou,n cpSruomt,o ,u iunntr6o4l_lt>,( )N.CrCuLn_(A)L;G O\_ T R| E ^E , NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hE:,670 :215): note: | field 'nthreads' will be initialized after field 'tidInBlock'^ 670 | tid(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hd:)611,: 62n:t hnote: reexpanded from macro 'DEFINE_ncclDevFunc'a ds(nthreads), t611i | d I n B lRoucnkW(otrhkrBeaatdcIhd , | a ^~~~~~~~~~~~~~~~~l go, pro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:o670,: 60u:n rnote: ofield 'group' will be initialized after field 'stepSize'l l>().run(); \670 | | ^ tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hi:d670I:n15B:l onote: cfield 'nthreads' will be initialized after field 'tidInBlock'k (threadIdx.x), g670r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~n threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hexpanded from macro 'DEFINE_ncclDevFunc' :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 611 | RunWorkBatchd,) ,a lngtoh,r epardost(on,t hurneraodlsl)>,( )t.irduInn(B)l;o c\k ( t| h ^r eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 671 | s670t | e p S i ztei(ds(tteipdS)i,z en_t h=r=e a0d s?( nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~~~~~~~P S/sizeof(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hT:)670 ::60: snote: tfield 'group' will be initialized after field 'stepSize'e pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~670 | | group(group tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hn:B303l:o90c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereh readIdx.x), gr o303u | p ( g r o u pP)r,i m i| t ^~~~~~~~~~~i ves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ <1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group tepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threaroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1818 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppr_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work-/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ >channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29::218:15: warning: unused variable 'bid' [-Wunused-variable] 15: note: expanded from macro 'barrier_by_group' 218 | const int bid = ncclShmem .channelId - work->channelLo; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h.:x432),: 78:g ronote: uin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herep( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 432 | 671 | i f s(tteipdS iL(E)]./rNuCnC(Lt_iSdT,E PsSu/bstinz,e owfo(rTk)) ;: s| t ^e pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hl:D303e:v90F:u nnote: cin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here( AllReduce_RING_SI M303 | P L E _ S u mP_rui8m_i2t,i venscI, N/G*,D iNrCeCcLt_=P*R/O0T,O _PSrIoMtPo,L E0,> 2p)r i m| s^ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h::611565::625:: note: note: expanded from macro 'DEFINE_ncclDevFunc'in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 611 | 565 | RruunnWTorrekeBUaptDcohwS,i mapllgeo<,1 ,p r1o,t oC,O LuLnr_oUlNlR>O(L)L.>r,u nC(O)L;L _\U N R| O ^ LL>(tid, nthreads, work);/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h | : 432 : 78 :t inote: din instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here( tid), nthreads(n t432h | r e a d s) , itfi d(ItniBdl o() .r670 | un ( t i dt, isudbt(n, twiodrk)),; nt| hr ^e ads(nth/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppr:e17:a1:d s)note: ,in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here tidInBl o17 | cDEkF(ItNhE_renaccdIldDexv.Fxun)c(,A llgrRoeudp(ugrocuep_)TR,EE _S I| M ^~~~~~~~~~~P LE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: tin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from adIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work-:>/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId 29:c15: hnote: expanded from macro 'barrier_by_group'a -n work->channelLo; | ^~~ 29n | e l cLonost; in t w | = ^~~th readIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ lag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp :2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h: 27:15: warning: unused variable 'bid' [-Wunused-variable] | ^~~~~27 | c onst in/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.ht bid = ncclShmem.channelId - work->cha:nnelLo; 145| ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:35::218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:e15: warning: adinitializer order does not match the declaration order [-Wreorder-ctor] s), tidInB 670 | tid(tid), nlthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.ock(threadIcomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group dx.x), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads):670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, PIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670roto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(ti | d , s ub ttni, dw(otrikd);) , | n ^t hreads(nthreads)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp,: 7t:i1d:I nnote: Bin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested herel ock(threadIdx.x), 7g | rDoEuFIpN(Eg_rnocuclpD)e,v F u| n ^~~~~~~~~~~c (AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL:670:15:_ALGO_TREE, NCCL_PROTO_SIMPwarning: LE, 2) | ^ initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611 670:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Pri:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(Al:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::229: :In file included from 15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:: 11note: : expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h: 8029: | 5 : warning: counused variable 'w' [-Wunused-variable]n st int w = threadIdx.x/WARP_SIZ E80; | \ | b ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | coIn file included from nst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h::15670::15 : warning: note: field 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads( n670t | hr e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dIdx.x)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h,: 670g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 670| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tid(tid), nth r671e | a d s ( nsttherpeSaidzse)(,s tteipdSIinzBel_o c=k=( t0h r?e andcIcdlxS.hxm)e,m .gcroomump.(bgurfofuSpi)z,e s [| N ^~~~~~~~~~~C CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thIn file included from readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepS/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDivi_iz8e_(4n,c cnlccSlhFmuenmc.AclolmRme.dbuucfef,S iFzuensc[SNuCmCPLo_sPtRDOiTvO,_ LiLn1t288_]t/,N CNCCLC_LS_TAELPGSO/_sTiRzEeEo,f (NuiCnCtL6_4P_RtO)T)O _{S IM P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , | 4 group(group) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchnote: , in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested herea lgo, proto, unro l63l | > ( ) . rPurni(m)i;t i\v e s| < ^T , RedOp, FanSymme/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:r670i:c15<:1 >note: ,field 'nthreads' will be initialized after field 'tidInBlock' 0, Proto, 0> pr i670m | s | ^t id(tid), nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.ha:d1062s:(5n:t hnote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested heree ads), tidI n1062B | l o c k (rtuhnrReiangd/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h(:t670i:d60,: nnote: tfield 'group' will be initialized after field 'stepSize'h reads, work); | ^ 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hs):,432 :t78i:d Inote: nin instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested hereB lock(threadIdx.x), g r432o | up (g r ou p ) ,i f | ^~~~~~~~~~~( tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_S/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ umPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: 670 | initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nthreads(nthreads), t i670d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~n Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bant32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h75 | bar:rier_by_grou11p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: :29:15: note: expanded from macro 'barrier_by_group'In file included from 29 | co/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hnst int w =: threadIdx.175x/WARP_SIZ: E; \In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = re | ^ In file included from cvPtr(0)+ll128Offset/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->cIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from hannelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here O_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.c/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto,omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AlldOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch ,n tahlrgeoa,d sp(rnotthor,e aundrso)l,l >tidInBlock(threadIdx.x), group((g)ro.urpu)n,( )| ; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ \| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | ^ 671 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670 : 15s:tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.chanIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:In file included from warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fl:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ag1, data2, fl/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :: 670s:t15e:p Swarning: izinitializer order does not match the declaration order [-Wreorder-ctor]e _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hh:r303e:a90d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here tidInBlock(thr e303a | d I d x . x )P,r igmriotuipv(egsrt,e p/S*iDzier_e c=t== *0/ 0?, nPcrcoltSoh,m e0m>. cpormimm.sb u f| f ^S izes[NCCL_PROT/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hO:_565S:I5M:P note: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereE ]/NCCL_STEPS /565s | i z e o fr(uTn)T r:e esUtpeDpoSwinz, COLL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hUN:R303O:L90L:> (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herei d, nthreads, w o303r | k ) ; | ^P rimitives, 0, 2, 4>::run' requested herec <1, NCCL_MAX_ D432E | V _ A R I T Yi>f, (/t*iDdi rl lp, ProtoSimple<1, 1, 4>, 4>' requested hereL _UNROLL>().run (565t | i d , sruubntTnr,e ewUoprDko)w;n < T| , ^ RedOp, ProtoSimple<1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp,: 171:,1 :C Onote: Lin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereL _UNROLL>, COLL_UNR O17L | LD>E(FtIiNdE,_ nnctchlrDeeavdFsu,n cw(oArlkl)R;e d| u ^c e_TREE_SIMPLE_SumPostDiv_u/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h6:4432_:478,: nnote: cin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herec lFuncAllReduc e432, | F u n c S uimfP o(sttiDdi v<, suuibnttn6)4 _Rtu,n WNoCrCkLC_oAlLlG/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h():.611:ru62:n (tnote: iexpanded from macro 'DEFINE_ncclDevFunc'd , sub t611 | n , wo rRukn);Wo rk| ^B atch, 0, 2, 4>::run' requested here redop17, | DaEFlgINoE,_ ncprcoltoD, eunvrFuolnlc(>A(l)l.rReundu(c)e; _T\ R EE| ^_ SIMPLE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h_:Su670:mP15:o snote: tDfield 'nthreads' will be initialized after field 'tidInBlock'i v_u6 4670_ | 4, n cctliFdu(nticdAl),lR endtuche,r FeuandcsS(unmtPhoresatdsD)i,v, tiudiInntBl6oc4k_(tth, reNaCCdILd_xA.xL),G gOr_oTuRpEE(g,r oNuCpC),L _ P| ^~~~~~~~~~~~~~~~~RO T/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hO_:670S:IM60:P Lnote: field 'group' will be initialized after field 'stepSize'E , 4) 670| | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ht:i611d:(62:t note: iexpanded from macro 'DEFINE_ncclDevFunc'd) , nt h611re | ad s (nRutnhWreoarkdsBa)t, cth.x,) ,a lggor,o uppr(gotroo,up u),n r| o ^~~~~~~~~~~l l>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(In file included from ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLoIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp :6712 | : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h : 11s: tIn file included from e/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hp:S173i: ze/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h(s:t670e:p15S:i zwarning: einitializer order does not match the declaration order [-Wreorder-ctor]_ == 0 ? ncclShmem.comm.buffSizes[NCC L670_ | P R O T Ot_iSdI(MtPiLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) t:i dsItneBplSoiczke(_t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.he:m254.:c90o:m mnote: .bin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereu ffSizes[NCCL_P R254O | T O _ S I M PPLrEi]m/iNtCiCvLe_sS, /*Direct=*/0, Proto, 0> prims /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h :| 254 ^: 90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: 254in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here | Primitiv e565s | < T , RreudnOTpr,e eFUapnDAoswynmC,O L/L*_DUiNrReOcLtL=>*,/ 0C,O LPLr_oUtNoR,O L0L>> (ptriidm,s n t| h ^r eads, work/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h):;565 : 5| : ^ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.he:e432U:p78D:o wnote: nin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here< T, RedOp, Prot o432S | i m p l e if (tid< <1 ,s u1b,t nC)O LRLu_nUWNoRrOkLCLo>l,l p(,t iAdl,g on,t hPrreoatdos,, CwOoLrLk_)U;N R O| L ^L >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 432 | if (tid < s7u | bDtEnF)I NREu_nnWcocrlkDCeovlFlu2(,) .nrcucnl(Ftuindc,A lsluRbetdnu,c ew,o rFku)n;c S u| m ^P ostDiv, uint8_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cppt:,7 :N1C:C Lnote: _in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereA LGO_TREE, NCCL_PROT O7_ | SDIEMFPILNEE,_ n2c)c l D| e^v Func(AllReduce_T/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hR:E611E:_62S:I Mnote: Pexpanded from macro 'DEFINE_ncclDevFunc'L E_SumPostDi v611_ | u 8 _2 , RnucncWloFruknBcaAtlclhRi,v ,a lugion,t 8p_rto,t oN,C CuLn_rAoLlGlO>_(T)R.ErEu,n (N)C;C L\_ P R| O ^T O_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 670 | t i611d | ( t i d )R,u nnWtohrrkeBaadtsc(hnk,( tahlrgeoa,d Ipdrxo.txo),, ugnrrooulpl(>g(r)o.urpu)n,( ) ;| ^~~~~~~~~~~~~~~~~\ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hfield 'group' will be initialized after field 'stepSize': 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t i670d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp173:: 17/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:1::670 :note: 15in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 17 | DEFINE_ncclDev F670u | n c ( A ltliRde(dtuicde)_,T RnEtEh_rSeIaMdPsL(En_tShurmePaodsst)D,i vt_iud8I_n4B,l oncckc(ltFhurnecaAdlIldRxe.dxu)c,e ,g rFouunpc(SgurmoPuops)t,D i v| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ u i| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_t 8_t, NCCL_ALGO_T R671E | E , N CsCtLe_pPSRiOzTeO(_sStIeMpPSLiEz,e _4 )= = | 0^ ? ncclShmem.comm.bu/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hf:f611S:i62z:e snote: [expanded from macro 'DEFINE_ncclDevFunc'N CCL_PROTO_SI M611P | L E ] / NRCuCnLW_oSrTkEBPaSt/cshi), {a l g| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, p| r group(groupo to, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here: 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 254 | 670 | P r i mtiitdi(vteisd<)T,, nRtehdrOepa,d sF(anntAhsryemamdest)r,i ct,, g/r*oDuipr(egcrto=u*p/)0,, P| r ^~~~~~~~~~~~~~~~~o to, 0> /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hp:r670i:m60s: note: | field 'group' will be initialized after field 'stepSize' ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSh/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h18 warnings generated when compiling for host. :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hdId:x670.:15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here e, 432F | if (tid u< subtn) RunWorncSumPoskColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 1818 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp 22 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1In file included from : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:In file included from 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = r/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ecvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SI uint32_t y, head, mantissa; | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 19 | cons t i nt ubid = ncclShmem.channelId - work->channelLo; | ^~~ int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti d IbnaBrlroicekr(_tbhyr_egardoIudpx(.)x;) , | g ^~~~~~~~~~~~~~~~~~r oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h60::29 :note: 15field 'group' will be initialized after field 'stepSize': note: expanded from macro 'barrier_by_group' 670 | t29i | d ( t i dc)o,n sntt hirneta dws (=n tthhrreeaaddsI)d,x .txi/dWIAnRBPl_oScIkZ(Et;h r\e a d| I ^d x.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from 1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h ti:d670(:t15i:d )warning: , initializer order does not match the declaration order [-Wreorder-ctor]n threads(nthreads), tidInBlock(thread I670d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ffset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:111:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 111 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthred(tid), nthreads(ands(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670: | tid(tid), nthr670eads(nthread:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]s), ti dInBloc k(threadIdx.x670 | tid(ti),d group(g)roup),, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671n | stepStize(stepShize_ == 0 r? ncclShmeem.comm.buaffSizes[NdCCL_PROTOs_SIMPLE]/N(CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArgprev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 22 warnings generated when compiling for gfx90a. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp In file included from 12 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: 12 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 11 warning generated when compiling for gfx1201. warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1101. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp 13 warnings generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1:: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:note: warning: unused variable 'ptr' [-Wunused-variable] expanded from macro 'barrier_by_group'271 | uint64_t* ptr = recvPtr(029)+ll128O | ffset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :f29la:g151,: dnote: ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hexpanded from macro 'barrier_by_group': 145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | 29u | i n t 3 2c_otn sdta tian1t, wf l=a gt1h,r edaadtIad2x,. xf/lWaAgR2P;_ S I| Z ^~~~~ E; \/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h : 145| : ^28 : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] :174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/W/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:ARP_SIZE; \ | ^ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreaIn file included from ds(nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:s), wid(tid%WARP_19SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(In file included from )/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp;: 1: In file included from | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: ^~~~~~~~~~~~~~~~~~13 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::529:: 15warning: : unused variable 'w' [-Wunused-variable]note: expanded from macro 'barrier_by_group' 29 | con s80t | i n t bwa r=r itehrr_ebayd_Igdrxo.uxp/(W)A;R P _| S ^~~~~~~~~~~~~~~~~~I ZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | | barrie ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cppea:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ PS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ .x), group(group), In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, In file included from flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:17315: warning: initializer order does not match the declaration order [-Wreorder-ctor] : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. In file included from 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ inMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptrWARP_SIZE; \ | ^ = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:In file included from 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; : | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1111 warnings generated when compiling for gfx942. warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> p/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: 77/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(In file included from ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1expanded from macro 'barrier_by_group' 29 | const int w: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11In file included from warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 1111 warnings generated when compiling for gfx908. warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 1111 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:131: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | st/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508epSize(ncclS:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ hmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cppx:.1x: /In file included from WA/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hR:P13_: SIn file included from IZ/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hE:;175 : \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h | : ^80 :5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: _STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitive271:19: warning: s, 1, Pr uint64oto, 0> pri_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ms | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tIn file included from idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:2977:18: warning: unused variable 'y' [-Wunused-variable] :77 | 15 : uint 32_t y, note: head, mantissa; | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cppIn file included from :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthrIn file included from e/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ha:d670s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), wid(tid%W A670R | P _ S I ZtE)i,d (wtairdp)(,t indt/hWrAeRaPd_sS(InZtEh)r,e ad s| ) ~~~~~~~~~~~~~~~~~~, t| i stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)d InBlock(threadIdx.x), 507g | r o u p (wgarrpoIunpB)l,o c k| ( ^~~~~~~~~~~t hreadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ * ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROIn file included from TO_L/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cppL:1218: ]In file included from /N/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hC:C13L: _In file included from S/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hT:E173P: S//builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hsiz:e670o:f15(:u iwarning: ntinitializer order does not match the declaration order [-Wreorder-ctor]6 4_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h(:t199h:r57e:a dnote: Idin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herex .x), group( g199r | o u pP)r,i m i| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i v e| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_< T, RedOp, FanAsy m671m | e t r i csi,z e1(,s tPerpoStioz,e _0 >= =p r0i m?s n c| c ^l Shmem.comm.buffSizes[NCCL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cppP:R3O:T1O:_ Snote: Iin instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested hereM PLE]/NCCL_ S3T | EMPSSC/CsLi_zIeMoPfL(_TK)E R:N EsLt_eEpNSTiRzYe__F)U N{C _ D| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V R E| D group(groupO P_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 384 | ms c199c | l R uPnrIinmtietripvreest>,, 1P,r oPtrooLtLo1,2 80,> fpurlilmOsp s >| ( ^c omm, algo, work)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp;: 3\: 1 :| ^note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h : barrie175r_by_gr: oup(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29::15: note: expanded from macro 'barrier_by_group' 8029 | const int w = threadIdx.x/WARP_SIZE; \ : | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, fIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthalse); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 1111 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fIn file included from lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | PrimitivIn file included from es, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, maIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ntissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cppIn file included from :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ead((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] Ops 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ >(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group Idx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h::377::118:: note: warning: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here unused variable 'y' [-Wunused-variable] 3 | MSCCL_IMPL_KERNEL_E N77T | R Y _ F U N C _ DuEiVnRtE3D2O_Pt_ TyY,P Eh(ePardo,d ,m arnctcils_sfal;o a t| 8 ^, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | bar/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:r1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:i13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:e7: warning: unused variable 'w' [-Wunused-variable] r75 | b_arriber_by_group();y | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h_:29:15g: note: expanded from macro 'barrier_by_group' r29 | o const int w = tuhreadIpdx.x/WARP(_SIZE; \ ) | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrieIn file included from r/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouIn file included from p), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 In file included from 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:27175:7::19: warning: unused variable 'ptr' [-Wunused-variable] 271 | warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here note: expanded from macro 'barrier_by_group' 29 | const int w = thread 199 | PIdx.x/WARP_SIZE; \ | ^ rimitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cppby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::29:15: note: expanded from macro 'barrier_by_group' 29 | 1 const int w = : threadIdx.x/WARP_SIIn file included from ZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tireads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:In file included from 18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp : 1 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hu:i12n: tIn file included from 3/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h2:_14t: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.hy:,77 :h18e:a dwarning: ,unused variable 'y' [-Wunused-variable] mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :670173 | : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670t:i15d:( twarning: idinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads(nthreads), tidInBlock(threa d670I | dx . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp 80 | : 1 : In file included from b/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.ha:r13r: iIn file included from e/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hr:_174b: y_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hg:r75o:u7p:( )warning: ; unused variable 'w' [-Wunused-variable] | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :7529 | : 15 : note: expanded from macro 'barrier_by_group' barrier_by_ g29r | o u p ( )c;o n s| t ^~~~~~~~~~~~~~~~~~ int w = thr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.he:a29d:I15d:x .note: xexpanded from macro 'barrier_by_group'/ WARP_SIZE; \29 | | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h const int :173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]w = threa dIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_tIn file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work )506; | \ | t ^i d(tid), nthreads(nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ha:d670s:)15,: wnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid%WARP_SIZE) ,670 | w a r p (ttiidd(/tWiAdR)P_,S InZEt)h,r e a| d ~~~~~~~~~~~~~~~~~~s ( n| t stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)h reads), tid I507n | B l o c kw(atrhprIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr/oWuApR)P, _ S| I ^~~~~~~~~~~~~~~~~Z E), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670| : warp(tid/WARP_SIZE60 : note: field 'group' will be initialized after field 'stepSize' 508 | f670l | a g T h rteiadd((t(itdi)d,%4 )n=t=h3r)e,a dgsr(onutph(rgeraoudps)),, t| i ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~d I n| B warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3l ock(threadIdx .509x | ) , g rsotuepp(Sgirzoeu(pn)c,c l S| h ^~~~~~~~~~~m em.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1:: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from dIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cppx.:x1/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ype, Func##devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), wIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from arp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId ^ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cppata1, fla:1g: In file included from 1/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:,13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175d: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.ha:80t:5a: 2warning: unused variable 'w' [-Wunused-variable], fl80a | g 2 b;a | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hr:ri145er:_b35y_:gr owarning: up(unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, fZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:In file included from 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h: 80271: | 5 : warning: unused variable 'w' [-Wunused-variable] uint64_t* ptr = recvP t80r | ( 0 ) + lbla1r2r8iOeffrs_etb;y _ g| r ^~~o up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h(tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRu/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 11 warnings generated when compiling for gfx90a. stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cppexpanded from macro 'barrier_by_group': 1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: 17429: | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h : 75c:o7n:s twarning: iunused variable 'w' [-Wunused-variable]n t w = threadIdx.x/ W75A | R P _ S I Z Eb;a r\r i e| r ^_ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx906. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp 11 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_g145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlocZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ k(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flawarning: g1, data2, flag2; | ^~~~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp2, flag2; | ^~~~~ :1: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from In file included from :174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreadsste)pSize_ == 0 ,? ncclShmem.co mm.buffSizesw[NCCL_PROTO_iSIMPLE]/NCCLd_STEPS/sizeof((T) : stepSizte_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ i| group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hd:199:%WA57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereR 199 | PPrimitives, 1, PrZoto, 0> primEs | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp):3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here, 3 | MSCC L_IMPL_KERNEwL_ENTRY_FUNC_DEaVREDOP_TYPE(Surm, rccl_floapt8, false);( | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387t:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | msciclRunInterpreter,W ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here hreadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 508 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cons/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] t int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] VREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384 :3: note: 670 | tiexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd(tid), n thre 384 | mscads(ntchreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ lRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tidIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | (tid), nthread tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_s 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from In file included from | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from b ar/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp29 | :1r: In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 11 warnings generated when compiling for gfx1200. ; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1 warnings generated when compiling for gfx90a. , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29In file included from :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from 145/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h | : 13 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hu:i175n: t3/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h2_:t80 :d5a:t awarning: 1unused variable 'w' [-Wunused-variable], flag1, data2, flag2; | ^~~~~80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uintIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: In file included from warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp1, flag1, :data2,1 f: lag2; In file included from | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:145:21: :warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from :13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> pri11 warnings generated when compiling for gfx908. ms | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp 11 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grIn file included from oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 28Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0In file included from )+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128In file included from Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp: ^~~~~2 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h11:: 145In file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:17521:: warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hunused variable 'flag1' [-Wunused-variable] :271:19: 145warning: | unused variable 'ptr' [-Wunused-variable] uint32_t data1, 271 | fl a g 1 , d at a u2,i nftl6a4g_2t; * | ^~~~~p tr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h :145=:28 :r ewarning: cunused variable 'data2' [-Wunused-variable]v Ptr( 0145) | + l l 1 2u8iOntf3f2s_te dtat;a1 , | f ^~~l ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from w = threadIdx.x/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2/: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from W/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]A 75 | R barriPer_by_group_(); | ^~~~~~~~~~~~~~~~~~ S/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: Inote: expanded from macro 'barrier_by_group' 29 | Z const iEnt w = threadIdx.x/WARP_SIZE; \ | ^ ; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h670 | : 11 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.ht:id173(: ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd):, 670n:t15h:r ewarning: adinitializer order does not match the declaration order [-Wreorder-ctor]s( nthreads), tidInBlock(threadIdx.x), gro u670p | (g r ou p )t,i d| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i| d) tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ , nthrea 671 | d s ( n tshtreepSaidzes(),st etpiSdiIzneB_l o=c= k0( t? hnrcecaldIdSxh.mx)e,m .gcroomump.b(ugfrfoSuizpes)[,N C CL| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P RO T| O_ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_S IMPLE]/NCCL_STE P671S | / s i z eostfe(pTS) i: zes(tesptSepiSzie_z)e _{ =| = ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 0 | group(group ? ncclShmem.comm.buffSizes[NCC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hL:_33P:R7O:T Onote: _in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here SIMPLE]/NCCL_STE P33S | / s i z e o fp(rTi)m s:( tsitde,p Snitzher_e)a d{s , | & ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r i n| g group(group- >prev, &ring->next, work->sendbuff, work->recvbuff, wo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hr:k33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rc-cl_bfloat8, NCC>Lr_eALdGOOp_ARrIgN,G ,0 ,NC CwLo_rPRkO-TO>_cSIoMnPnLIEn,d e4x), w| o^r k->connIndex)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h;: 611: 62| ^: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h,: a63l:g5o:, note: prin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested hereo to, unroll>().run (63) | ; \ r| u ^n Ring(tid ,670 | n th r e atdisd,( twiodr)k,) ;n t h| r ^e ads(nthreads), tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h(:t432h:r78e:a dnote: Iin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hered x.x), group(gro u432p | ) , | ^~~~~~~~~~~~~~~~~ if (ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd: 670<: 60s:u bnote: tfield 'group' will be initialized after field 'stepSize'n ) RunWorkC o670l | l < F n ,t iTd,( tRiedd)O,p , nAtlhgroe,a dPsr(notthor,e aCdOsL)L,_ UtNiRdOILnLB>l(oc)k.(rtuhnr(etaiddI,d xs.uxb)t,n ,g rwoourpk()g;r o u| p ^) , | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ s, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x//builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 11 warnings generated when compiling for host. RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated11 when compiling for gfx942. warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 1111 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:x)15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work-In file included from >recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byn_tg3r2o_utp (d)a;t a 1| , ^~~~~~~~~~~~~~~~~~ flag1, data2, flag2/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h;: 29 :| 15 ^~~~~: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: data2, flaIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 80 | barrier_by uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nth re670a | ds ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~t hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 11 warnings generated when compiling for host. tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, woIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->serk->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ndbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp 11 warnings generated when compiling for gfx942. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from SIZ/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cppE:; 2\: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h| : ^ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 6710 | ? ncc l S h msetme.pcSoimzme.(bsutfefpSSiizzees_[ N=C=C L0_ P?R OnTcOc_lSSIhMmPeLmE.]c/oNmCmC.Lb_uSfTfESPiSz/essi[zNeCoCfL(_TP)R O:T Os_tSeIpMSPiLzEe]_/)N C{C L _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E P| S group(group/ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hd,: 33n:t7h:r enote: adin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heres , &ring->prev, &33r | i n g - > n epxrti,m sw(otrikd-,> snetnhdrbeuafdfs,, w&orrikn-g>-r>epcrvebvu,f f&,r iwnogr-k>-n>erxetd,O pwAorrgk,- >0s,e nwdobrukf-f>,c ownonrIkn-d>erxe,c vwbourfkf-, wo>rcko-n>nrIenddOepxA)r;g , | 0 ^, work->connIndex/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h,: 63w:o5r:k -note: >in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herec onnIndex); | 63 ^ | runRing, ProtoSimple<1, 1, 4>, 4>' requested here, Proto, COLL _63U | N R O L Lr>u(ntRiidn,g (tid, nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hd:s432,: 78w:o rnote: kin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here) ; | ^ 432 | if (t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hi:d432 :<78 :s unote: bin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested heret n) RunWorkColl <432F | n , T , Riefd O(pt,i d A (T),. rRuend(Otpi,d ,A lsguob,t nP,r owtoor,k )C;O L L| _ ^U NROLL>().run(tid,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp :s7u:b1t:n ,note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herew ork); | ^ 7 | DEFIN/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cppE:_12n:c1c:l Dnote: ein instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herev Func(Reduce_RING _12S | IDMEPFLIEN_EM_inncMcalxD_euv8F_u2n,c (nRcecdluFcuen_cRRIeNdGu_cSeI,M PFLuEn_cMMiinnMMaaxx_,u 8u_i4n,t 8n_ctc,l FNuCnCcLR_eAdLuGcOe_,R IFNuGn,c MNiCnCMLa_xP,R OuTiOn_tS8I_MtP,L EN,C C2L)_ A L| G^O _RING, NCCL_PROT/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hO:_611S:I62M:P Lnote: Eexpanded from macro 'DEFINE_ncclDevFunc', 4) | ^ 611 | Ru/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hn:W611o:62r:k Bnote: aexpanded from macro 'DEFINE_ncclDevFunc't chr,k Baaltgcoh,< cporlolt,o ,t yu,n rroeldlo>p(<)t.yr>u,n (a)l;g o\, p| r ^o to, unroll>().run();/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :\670 : 15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670 : 15t:i note: dfield 'nthreads' will be initialized after field 'tidInBlock'( tid), nthreads (670n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~I dx.x),/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :g670r:o60u:p (note: gfield 'group' will be initialized after field 'stepSize'r oup), | ^~~~~~~~~~~~~~~~~ 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : 670 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), nthre a670d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodsc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhroeuapd)I,d x .| x ^~~~~~~~~~~) , group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] Idx.x), group(gr o670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthrup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 671 | ste/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hp:S670i:z15e:( snote: tfield 'nthreads' will be initialized after field 'tidInBlock'e pSize_ == 0 670? | n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp) (:g rsotuepp)S,i z e| _ ^~~~~~~~~~~~~~~~~) { /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~670 : 60| : group(group note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hh:re33a:d7I:d x.note: x)in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here, group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const: note: expanded from macro 'barrier_by_group' i29 | connst int w = tthreadIdx .x/WARP_SIwZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]11 75 | : In file included from bar/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hrier_by_g:roup(); 174| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:: 29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h 29 | :const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp 145 | uint32:_t data1, 2flag1, d: ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work-In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ubtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cppg:2r: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.ho:11: uIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175p: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h(:80:5): warning: ;unused variable 'w' [-Wunused-variable] 80 | | ^~~~~~~~~~~~~~~~~~ ba r/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hrier_b:y_gro29up:(); 15 | : ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h note: :29:expanded from macro 'barrier_by_group'15 : note: expanded from macro 'barrier_by_group' 29 | 29 cons | t int w = thr const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hCL:_670P:R15O:T Owarning: _Sinitializer order does not match the declaration order [-Wreorder-ctor]I MPLE]/NCCL_STEPS/sizeof(T) : stepSi z670e | _) { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 33 :| 7 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 671 | s t33e | p S i z e ( sptreipmSsi(ztei_d ,= =n t0h r?e andcsc,l S&hrmienmg.-c>opmrme.vb,u f&frSiinzge-s>[nNeCxCtL,_ PwRoOrTkO-_>SsIeMnPdLbEu]f/fN,C CwLo_rSkT-E>PrSe/csvibzuefoff,( Tw)o r:k -s>treepdSOipzAer_g), {0 , | w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| - group(group> connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here: 63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 6333 | | r u npRriinmgs<(Tt,i dR,e dnOtph,r ePardost,o ,& rCiOnLgL-_>UpNrReOvL,L >&(rtiindg,- >nntehxrte,a dwso,r kw-o>rske)n;d b u| f ^f , work->recvbuff, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hw:o432r:k78-:> rnote: ein instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hered OpArg, 0, work -432> | c o n n I n diefx ,( twiodr k<- >scuobntnnI)n dReuxn)W;o r k| C ^o ll, ProtoSimple<1, 1, 4>, 4>' requested herel go, Proto, 63C | O L L _ UrNuRnORLiLn>g(<)T.,r uRne(dtOipd,, Psruobtton,, CwOoLrLk_)U;N R O| L ^L >(tid, nthreads, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cppw:o12r:k1):; note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here| ^ 12 | DEFI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hN:E432_:n78c:c lnote: Din instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested heree vFunc(Reduce_R I432N | G _ S I M P LiEf_ P(rteiMdu lR(I)N.Gr,u nN(CtCiLd_,P RsOuTbOt_nS,I MwPoLrEk,) ;4 ) | ^| ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp611::1262::1 :note: expanded from macro 'DEFINE_ncclDevFunc'note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 611 | RunWor k12B | aDtEcFhIc,e _aRlIgNoG,_ SpIrMoPtLoE,_ PurnerMoulllS>u(m)_.fr3u2n_(4),; n\c c l| F ^u ncReduce, FuncPre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hM:u670l:S15u:m ,note: field 'nthreads' will be initialized after field 'tidInBlock'f loat, NCCL_A L670G | O _ R I NtGi,d (NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt,h r4e)a d s| ^) , tidInBlock(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ht:h611r:e62a:d Inote: dexpanded from macro 'DEFINE_ncclDevFunc'x .x), group (611g | r o u p )R,u n W| o ^~~~~~~~~~~~~~~~~r kBatch, a670l | g o , ptriodt(ot,i du)n,r onltlh>r(e)a.drsu(nn(t)h;r e\a d s| ) ^, tidInBlock/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h(:t670h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), group( g670r | o u p ) ,t i d| ( ^~~~~~~~~~~t id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | ba:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threarrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ set; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMP11 warnings generated when compiling for host. LE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ +ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtrunused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = r(0)+ll128Offset; | ^~~ ecvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncR | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h::17433:: 7/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h: :note: 75in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here: 7: warning: unused variable 'w' [-Wunused-variable] 33 | 75 | pr i m s ( tbiadr,r inetrh_rbeya_dgsr,o u&pr(i)n;g - >| p ^~~~~~~~~~~~~~~~~~r ev, &ring->n/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.he:x29t:,15 :w onote: rexpanded from macro 'barrier_by_group'k ->sendbuff, w29o | r k - > rceocnvsbtuf fi,n tw owr k=- >trherdeOapdAIrdgx,. x0/,W AwRoPr_kS-I>ZcEo;n n\I n d| e ^x , work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ educe, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp :b2a: rIn file included from r/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hi:e11r: _In file included from by/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h_:g174r: o/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hup:(145):;14 : | warning: ^~~~~~~~~~~~~~~~~~unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: 145note: | expanded from macro 'barrier_by_group' uint32 _29t | d a t ac1o,n sftl aign1t, wd a=t at2h,r efaldaIgd2;x. x| /W ^~~~~ ARP_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hS:I145:Z21E:; warning: \ unused variable 'flag1' [-Wunused-variable] | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, wIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]ork); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaIn file included from ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const 33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbufIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:f2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaIn file included from , work->reds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+warning: ll128Offset; | ^~~ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1100. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:In file included from 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp In file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1101. 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsIn file included from ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, wor/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp611::262: :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hnote: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2; | ^~~~~ ; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFu:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]ncReduce , FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from P_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | : const i29nt w = t:hreadIdx.x/W15ARP_SIZ:E; \ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: RP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :7 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cppoup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~:2: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl.ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from eadIdx.x/WARP_SIZE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2;: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h\ | ^ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hPR:OT11O: _SIn file included from I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hM:P173L: E]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/N:C670C:L15_:S TEwarning: PSinitializer order does not match the declaration order [-Wreorder-ctor]/ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hi:d33I:n7B:l onote: ckin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here( threadIdx.x), group(g r33o | u p ) , | p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r i m| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_( tid, nthreads, &r i671n | g - > p rsetve,p S&irzien(gs-t>enpeSxitz,e _w o=r=k -0> s?e nndcbculfSfh,m ewmo.rcko-m>mr.ebcuvfbfuSfifz,e sw[oNrCkC-L>_rPeRdOOTpOA_rSgI,M P0L,E ]w/oNrCkC-L>_cSoTnEnPISn/dseixz,e owfo(rTk)- >:c osntneIpnSdiezxe)_;) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.ht:i33d:,7 :n tnote: hrin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heree ads, work); | ^ 33 | prims(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hd:,432 :n78t:h rnote: ein instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herea ds, &ring->prev, &432r | i n g - > n eixft ,( twiodr k<- >ssuebntdnb)u fRfu,n WwoorrkkC-o>lrle rAeldgOop,A rPgr,o t0o,, wCoOrLkL-_>UcNoRnOnLILn>d(e)x.,r uwno(rtki-d>,c osnunbItnnd,e xw)o;r k )| ; ^ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp63::75::1 :note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herenote: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 63 | runR i7n | gD_(StIiMdP,L En_tPhrroeda_dfs6,4 _w2o,r kn)c;c l F| u ^n cReduce, FuncProd, doub/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hl:e432,: 78N:C Cnote: Lin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here_ ALGO_RING, NCCL_PROT O432_ | S I M P L E ,i f2 )( t i| d^ < subtn/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hk(threadIdx.x), group(grou:p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreaock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == d0s ? ncclShmem.comm.buffS,i z&ersi[nNgC-C>Lpr_ePvR,O T&Or_iSnIgM-P>LnEe]x/tN,C CwLo_rSkT-E>PsSe/nsdibzuefoff,( Tw)o r:k -s>treepcSvibzuef_f), {w o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- > r| e group(groupd OpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 33 | 63 | rpurniRmisn(gtLp_rUeNvR,O L&Lr>i(ntgi-d>,n enxtth,r ewaodsr,k -w>osrekn)d;b u f| ^f , work->recvbuff, work->redOpAr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hg:,432 :078,: wnote: oin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herer k->connIndex, work->c o432n | n I n d e x )i;f (| t ^i d < subtn) RunWork/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hC:o63l:l5<:F nnote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here T, RedOp, Algo, 63P | r o t o ,r uCnORLiLn_gU(d)O.pr,u nP(rtoitdo,, sCuObLtLn_,U wNoRrOkL)L;> ( t| i ^d , nthreads, work);/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp :| 12 ^: 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_nc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hc:l432D:e78v:F unote: nin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herec (Reduce_RING_SIMPL E_432P | r o d _ f 6 4i_f4 ,( tnicdc lF< usnucbRtend)u cReu,n WFournkcCPorlold<,F nd,o uTb,l eR,e dNOCpC,L _AAlLgGoO,_ RPIrNoGt,o ,N CCCOLL_LP_RUONTROO_LSLI>M(P)L.Er,u n4()t i d| ,^ subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12 :6111 | : note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here RunWorkBatche,v Faulngco(,R epdrucoet_oR,I NuGn_rSoIlMlP>L(E)_.Prruond(_f)6;4 _\4 , | n ^c clFuncReduce, FuncProd, double, NCC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hL:_670A:L15G:O _note: Rfield 'nthreads' will be initialized after field 'tidInBlock'I NG, NCCL_PROTO_SIMP L670E | , 4 ) t| i^d (tid), nthreads(nthr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.he:a611d:s62):, note: texpanded from macro 'DEFINE_ncclDevFunc'i dInBlock(thre a611d | I d x .Rxu)n,W ogrrkoBuapt(cghr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h,: 670a:l60g:o ,note: pfield 'group' will be initialized after field 'stepSize'r oto, unroll>( )670. | r u n ( )t;i d\( t i| d ^) , nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hi:d670I:n15B:l onote: ckfield 'nthreads' will be initialized after field 'tidInBlock'( threadIdx.x), grou p670( | g r o u pt)i,d ( t| i ^~~~~~~~~~~d ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:711 warnings generated when compiling for host. : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp 12 warnings generated when compiling for gfx90a. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from x/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ .: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in| ^~~~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptrPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)In file included from +ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFROTO_SIMPuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' LE]/N 611 | RunWCorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nCL_STEPS/sizethreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'o 670 | tf(T) id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,In file included from work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->r/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:e508:29: dwarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group O/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ pArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h warnings generated when compiling for host. :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SnBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PRO/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp 12 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barri/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' data2, fl 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _SIZE; \ | ^ | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h :14511 | : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h : 173u: i/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hnt:3752:_7t: dwarning: atunused variable 'w' [-Wunused-variable]a 1, flag1, data2, fl a75g | 2; | ^~~~~ barrier/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h_:b145y:_21g:r owarning: uunused variable 'flag1' [-Wunused-variable]p (); | ^~~~~~~~~~~~~~~~~~ 145 | uin/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.ht:3292:_15t: dnote: aexpanded from macro 'barrier_by_group't a1, flag1, 29d | a t a 2 ,c ofnlsatg 2i;n t | w ^~~~~ = th/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hr:e145a:d28I:d xwarning: .unused variable 'data2' [-Wunused-variable]x /WARP_SI Z145E | ; \ u| i ^n t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5ta1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from In file included from .x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+l/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ul128Offset; | ^~~ int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 11 warnings generated when compiling for host. 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, woIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. rk->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, wo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 75145 | u | int32_t data1, flag1 , data2, flag 2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hb:145:21a: warning: unused variable 'flag1' [-Wunused-variable] rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevIn file included from Func(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, maIn file included from ntissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp::2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h: 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hnote: :80:5: warning: unused variable 'w' [-Wunused-variable] expanded from macro 'barrier_by_group' 80 | bar rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:1529: note: expanded from macro 'barrier_by_group' | 29 | con st int w = thre adIdx.x/WARP_S IZE; \ | ^ In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:| ^~~~~~~~~~~~~~~~~~2 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: :In file included from 29/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::15:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hnote: expanded from macro 'barrier_by_group': 80:5: warning: 29unused variable 'w' [-Wunused-variable] | cons 80t | i nt b war r=i etrhr_ebyad_gIrdxo.uxp/(W)A; R P| _ ^~~~~~~~~~~~~~~~~~SI ZE; /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h\ : 29:| 15 ^: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from 271 | uint6/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp4:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->conn/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Index); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 11 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBa/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670t:ch<15coll:, ty , rewarning: dop, a lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29lag2;:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread Idx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Ptr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:In file included from 11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncR/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.heduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp| ^~~~~~~~~~~~~~~~~~: 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::27129::1519:: note: warning: expanded from macro 'barrier_by_group' unused variable 'ptr' [-Wunused-variable] 29 | const int w = threadIdx .271x | / W A R P _ S I ZuEi;n t\6 4 _| t ^* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp29:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WIn file included from A/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffsetIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.he:adIdx.x), 670group(gro:up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here: 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from \ | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp ^: 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_In file included from t d/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cppa:t2a: 1In file included from ,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h :f11l: aIn file included from g/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h1:, 174d: at/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.ha:275,: 7f:l awarning: g2unused variable 'w' [-Wunused-variable]; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp barr:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tiIn file included from | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:d), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. In file included from 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 80b | ar r i e rb_abryr_igerro_ubpy(_)g;r o u| p ^~~~~~~~~~~~~~~~~~( ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: 29note: | expanded from macro 'barrier_by_group' const int w29 | = t h rceoandsItd xi.nxt/ WwA R=P _tShIrZeEa;d I\d x .| x ^/ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: In file included from note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hgroup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:75:7: warning: unused variable 'w' [-Wunused-variable] :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:7515: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x | /WARP_SIZE; \ | ^ In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 12 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from 670 | In file included from tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.x.x), groucpomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre, nthreads(ntads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h11:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth warnings generated when compiling for gfx1201. reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatterIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz,e_) { FuncPreMulSum| , u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreadps, &ring-,> algo, proto, prevunroll>().run(); \ | ^ , &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h: 65670: | 5 : note: tin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested herei d(tid), nthread s65( | n t h r eraudnsR)i,n gt((gtriodu,p )n,t h r| e ^~~~~~~~~~~~~~~~~a ds, work)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h;: 670 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hd):,432 :n78t:h rnote: ein instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herea ds(nthreads), tid I432n | B l o c k ( tihfr e(adtIiddx .().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(ti/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bflo11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ at16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth11 warnings generated when compiling for gfx908. reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group()In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =670= :0 15?: ncwarning: clSinitializer order does not match the declaration order [-Wreorder-ctor]hm em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof (T670) | : s tteipdSi(ztei_d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 671 | stepSize(stepS i34z | e _ = = 0p r?i mnsc(ctliSdh,m enm.tchormema.dbsu,f f&Sriiznegs-[>NpCrCevL,_ P&RrOiTnOg_-S>InMePxLtE,] /woNrCkC-L>_sSeTnEdPbSu/fsfi,z ewoofr(kT-)> r:e csvtbeupfSfi,z ewo_r)k -{> r e| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O p A| r group(groupg , 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hnote: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here34 :7: 65 | runRinnote: g, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereT, RedOp, Proto, COLL _34 | U N R O L Lpr>i(mtsi(tdi,d ,n ntthhrreeaaddss,, &wroirnkg-)>;p r e| v ^, &ring->next, work->sendbuff, w/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ho:r432k:-78>:r enote: cvin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereb uff, work->redOpArg , 4320 | , w o r k - >icfo n(ntIindd e)c oRnunnInWdoerxk)C;o l l| < ^F n, T, RedOp, Algo,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h :P65r:o5t: onote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here COLL_UNROLL> (65) | . r u nr(utniRdi,n gs(tid, nt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpph:r12e:a1d:s ,note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herew ork); | ^ 12 | DEFINE_ncc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hl:D432e:v78F:u nnote: cin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here( ReduceScatter_RIN G432_ | S I M P LiEf_ (Ptriodd _L(G)O._rRuInN(Gt,i dN,C CsLu_bPtRnO,T Ow_oSrIkM)P;L E ,| ^4 ) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h::12611::162:: note: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereexpanded from macro 'DEFINE_ncclDevFunc' 611 | R u12n | WoDrEkBFaItNcEh_c,a tatlegro_,R IpNrGo_tSoI,M PuLnEr_oPlrlo>(d)_.brfu8n_(4),; nc\c l F| u ^nc ReduceScatter, FuncProd,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :r670c:c15l:_ bnote: ffield 'nthreads' will be initialized after field 'tidInBlock'l oat8, NCCL_ALGO_ R670I | N G , NCtCiLd_(PtRiOdT)O_,S InMtPhLrEe,a d4s)( n t| h^r eads), tidInBlo/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hc:k(611t:h62r:e note: aexpanded from macro 'DEFINE_ncclDevFunc'd Idx.x), gr o611u | p ( g r oRuupn)W,o r k| B ^~~~~~~~~~~~~~~~~a tch, algo ,670 | p r ot o , tuindr(otlild>)(,) .nrtuhnr(e)ad;s (\n t h| r ^e ads), tidInBlock(threadId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hx:.670x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from barrier_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cppb:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hy:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, w:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = th28Offset; | ^~~ readIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, id(tid), nthreads(nwthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ork->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRin g <| T group(group, RedOp, ProtoLL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:: 432note: :78in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | 34 | i f ( t i dp r,p rReevd,O p, Algo,& Prroitnog,- >CnOeLxLt_,U NwRoOrLkL->>(s)e.nrdubnu(ftfi,d ,w osrukb-t>nr,e cwvobrukff),; w o| r ^k ->redOpArg, 0, work->con/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cppn:I5n:d1e:x ,note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested herew ork->connIndex); | 5 ^ | DEFINE_ncclDevFu/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hn:c65(:R5:e dnote: uin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested herec eScatter_RI N65G | _ L L 1 2r8u_nPRriondg_ (Ftuindc,P rnotdh,r edaodusb,l ew,o rNkC)C;L _ A| L ^G O_RING, NCCL_PROTO_LL12/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h8:,432 :278): note: | in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here^ 432/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h | : 611 : 62 : note: iexpanded from macro 'DEFINE_ncclDevFunc'f (tid < sub t611n | ) R u nWRournkWCoorlklBt,o ,a lCgOoL,L _pUrNoROtLoL,> (u)n.rroulnl(>t(i)d.,r usnu(b)t;n ,\ w o| r ^k ); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DIn file included from EFI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cppNE:_2n: cIn file included from c/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hl:D11e: vIn file included from F/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hu:n173c: (Re/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hdu:c670e:S15c:a twarning: teinitializer order does not match the declaration order [-Wreorder-ctor]r _RING_SIMPLE_Prod_f64_2, ncclFuncReduc e670S | c a t t etri,d (Ftuindc)P,r ondt,h rdeoaudbsl(en,t hNrCeCaLd_sA)L,G Ot_iRdIINnGB,l oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ,g r2o)u p (| g^r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 671 | s t611e | p S i z eR(usntWeoprSkiBzaet_c h=<=c o0l l?, ntcyc,l Srhemdeom.pc.,b uaflfgSoiz,e sp[rNoCtCoL,_ PuRnOrToOl_lS>I(M)P.LrEu]n/(N)C;C L\_ S T| E ^P S/sizeof(T) : s/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:e670p:S15i:z enote: _field 'nthreads' will be initialized after field 'tidInBlock') { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h,: 34t:i7d:I nnote: Bin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested herel ock(threadIdx.x), 34g | r o u p ( gprroiumps)(,t i d| , ^~~~~~~~~~~~~~~~~ nthrea/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hd:s670,: 60&:r inote: nfield 'group' will be initialized after field 'stepSize'g ->prev, &rin g670- | > n e x tt,i dw(otrikd-)>,s enntdhbruefafd,s (wnotrhkr-e>ardesc)v,b utfifd,I nwBolrokc-k>(rtehdrOpeAardgI,d x0.,x )w,o rgkr-o>ucpo(ngnrIonudpe)x,, w| o ^~~~~~~~~~~r k->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffsIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ et; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 12In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[11 warnings generated when compiling for host. NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp :2 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.| ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2 ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptrIn file included from = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h :3411 | : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h : 174 : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hpr:i145m:s14(:t iwarning: d,unused variable 'data1' [-Wunused-variable] n threads, &ring->prev, &ring->ne x145t | , w o ruki-n>ts3e2n_dtb udfaft,a 1w,o rfkl-a>gr1e,c vdbautfaf2,, wfolrakg-2>;r e d| O ^~~~~p Arg, 0, w/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.ho:r145k:-21>:c owarning: nunused variable 'flag1' [-Wunused-variable]n Index, wo r145k | - > c o nuniInntd3e2x_)t; d a| t ^a 1, flag1, data2, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hf:l65a:g52:; note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here| ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]65 | ru n145R | i n g < Tu,i nRte3d2O_pt, dPartoat1o,, fClOaLgL1_,U NdRaOtLaL2>,( tfilda,g 2n;t h r| e ^~~~~a ds, /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hw:o145r:k35):; warning: | unused variable 'flag2' [-Wunused-variable] ^ 145 | uint32_t da/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ht:a4321:,78 : fnote: lin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herea g1, data2, flag2 ;432 | | ^~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from In file included from :/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:_t data1, flag1, data2, flag2; | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt tidInhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] if (t 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: inote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here < subtn) Ru n65W | o r k C orlulnN(RtOiLdL,> (n)t.hrruena(dtsi,d ,w osrukb)t;n , | w ^o rk); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp432: | 12 : 1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herei f (tid < subtn) RunWo r12k | CDoElFlIE(_)P.rroudn_(ut8i_d4,, snucbctlnF,u nwcoRrekd)u;c e S| c ^a tter, FuncProd, ui/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cppn:t78:_1t:, note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereN CCL_ALGO_RING, NC C7L | _DPERFOITNOE__SnIcMcPlLDEe,v F4u)n c (| R^e duceScatter_RING_SIMP/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hL:E611_:P62r:o dnote: _expanded from macro 'DEFINE_ncclDevFunc'u 8_2, ncclFun c611R | e d u c eRSucnaWtotrekrB,a tFcuhnL,_ AaLlGgOo_,R IpNrGo,t oN,C CuLn_rPoRlOlT>O(_)S.IrMuPnL(E),; 2\) | | ^^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'611 | RunWorkBa t670c | h < c o ltli,d (ttyi,d )r,e dnotphd,s (anltghor,e apdrso)t,o ,t iudnIrnoBlllo>c(k)(.trhurne(a)d;I d\x . x| ) ^, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | 670 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = reIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:In file included from warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11 : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:c174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75o:7: warning: nunused variable 'w' [-Wunused-variable] s75t | biarrnIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_bty_gIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ro wu p(=) ;threadIdx.x/WARP_SIZE; \ | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, w/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_R:670:15:ING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | R warning: initializer order does not match the declaration order [-Wreorder-ctor]unWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' t670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | baIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr In file included from = /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cppr:e2c: vIn file included from Pt/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hr(:011): +In file included from l/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hl:11752: 8/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hOf:f80s:e5t:; warning: | unused variable 'w' [-Wunused-variable] ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreax), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from //builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.coIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 1111 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h11: warnings generated when compiling for gfx1200. 174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: In file included from warning: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing[ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp 11 warnings generated when compiling for host. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, N/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ SIZE; \ | ^ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 611 | RunWorkBa t34c | h < c o l l ,p rtiym,s (rteiddo,p r,e aadlsg,o ,& rpirnogt-o>,pr euvn,r o&lrli>(n)g.-r>unne(x)t;, \w o r| k ^- >sendbuff, work->recvbuff, work->redOpA/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hrg:,670 :015,: wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r k->connIndex, work- >670c | o n n I ntdiedx()t;i d )| , ^ nthreads(nthreads), tid/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hI:n65B:l5o:c knote: (in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested heret hreadIdx.x) ,65 | g r o u pr(ugnrRoiunpg)<,T , | ^~~~~~~~~~~~~~~~~R edOp, P/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hr:o670t:o60,: Cnote: Ofield 'group' will be initialized after field 'stepSize'L L_UNROLL>(t i670d | , n t htrieda(dtsi,d )w,o rnkt)h;r e a| d ^s (nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hI:d432x:.78x:) ,note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereg roup(group), | 432 ^~~~~~~~~~~ | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 11 warnings generated when compiling for host. 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from x./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 11 warnings generated when compiling for gfx1200. uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_gro/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] : 145In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ui | uint32n_t64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffsIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SeIZE; \ | ^ t; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protoC,O LuLnr_oUlNlR>O(L)L.r>u(nt(i)d; ,\ n t| h ^r eads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreap, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclF11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp):2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h;:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint3275_t data1, flag1, | data2, flag 2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, fla g1, data2, flag2; | ^~~~~ b/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] a 145 | uinrt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAgroRPup(); | ^~~~~~~~~~~~~~~~~~ _/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->conn11 warnings generated when compiling for gfx1030. Index, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock11( warnings generated when compiling for gfx1100. threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5group:(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :29:15warning: : note: expanded from macro 'barrier_by_group' unused variable 'w' [-Wunused-variable]29 | co nst int w = threa80dI | d barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] :11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from f/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cppl:a2g: 1In file included from ,/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h :d11a: tIn file included from a/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h2:,175 : f/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hl:a271g:219;: warning: | ^~~~~unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h : 145 : 21 : warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpponst int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t int w = tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hreadIdx.x:/WARP_S174IZE; \ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | (tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:211: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: In file included from warning: unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h uint64:_t* ptr 175= recvPtr: (0)+ll128/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.hOffset; | ^~~ :80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from 15: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cppexpanded from macro 'barrier_by_group': 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h29: | 173 : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h c:o75n:s7t: iwarning: nunused variable 'w' [-Wunused-variable]t w = threadIdx.x/WARP_SIZ E75; | \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groupIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, d15ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3112_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h :173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreadsIn file included from , work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatt11 warnings generated when compiling for gfx1102. er_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->red work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ OpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(RIn file included from educe_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from 12In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp5:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: 145/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: | unused variable 'ptr' [-Wunused-variable] 271 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizeeads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ _/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h11 warnings generated when compiling for host:670:. 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint11 warnings generated when compiling for gfx1030. 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: 145 | uin note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp80 | : barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreaIn file included from ds, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, w 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here , 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x) 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->red:O670p:A15:r gwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] 0, work->connIndex, work->connIndex); | ^ 670 | tid(tid), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hn:t63h:r5e:a dnote: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here( nthreads), 63t | i d I n BrluoncRki(ntgh ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_d , nthreads, wo r671k | ) ; | s ^t epSize(stepSize_ == 0/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h :?432 :n78c:c lnote: Sin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereh mem.comm.buff S432i | z e s [ N C CiLf_ P(RtOiTdO _().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp::337::71:: note: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herein instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | D33E | F I N E _ n cpcrliDmesv(Ftuindc,( Rnetdhurceea_dRsI,N G&_rSiInMgP-L>Ep_rSeuvm,_ f&6r4i_n2g,- >nncecxltF,u nwcoRrekd-u>csee,n dFbuunfcfS,u mw,o rdko-u>brleec,v bNuCfCfL,_ AwLoGrOk_-R>IrNeGd,O pNACrCgL,_ P0R,O TwOo_rSkI-M>PcLoEn,n I2n)d e x| ,^ work->connIndex/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h):;611 : 62| : ^ note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63: 5611: | note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here RunWorkBa t63c | h < c o lrlu,n Rtiyn,g ,, Parlogtoo,, pCrOoLtLo_,U NuRnOrLoLl>l(>t(i)d.,r unnt(h)r;e a\d s ,| ^w ork); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h::15432:: 78note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 670 | 432 | t i d ( t iidf) ,( tnitdh r().run(/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:i670d:,60 :s unote: bfield 'group' will be initialized after field 'stepSize't n, work); 670| | ^ tid(tid), nthre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cppa:d12s:(1n:t hnote: rin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested heree ads), tidInBloc k12( | tDhErFeIaNdEI_dnxc.cxl)D,e vgFruonucp((Rgerdouucpe)_,R I N| G ^~~~~~~~~~~_ SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proadIdx.x), group(group), | ^~~~~~~~~~~ to, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, da/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h: :In file included from 29/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h::15174:: note: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hexpanded from macro 'barrier_by_group' :145:14: 29warning: | unused variable 'data1' [-Wunused-variable] const int w = thr e145a | d I d x. x/uinWtAR3P2__StI ZdEa; t\a 1| , ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flata2, flag2; | ^~~~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from ./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: ptr = recvPtr(0)+ll128Offset; | ^~~ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:In file included from 15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreIn file included from ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tideads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShme:m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h670 | :63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, datall128Offset; | ^~~ 2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx942. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp 12 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from onst int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset;/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1101. In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fla/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | ug1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from a2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cppIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uIn file included from int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hidIn:670:15B: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^lock(threadIdx. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hx:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' ) 670 | t,id(tid), nth reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc11k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h 33 | prims(tid, nthread:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | s, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagon2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;st int w = threadIdx.x/WARP_SIZE; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, In file included from &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h2,: 670n:c15c:l Fnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n cReduce, FuncSumPostDiv, ui n670t | 3 2 _ t ,t iNdC(CtLi_dA)L,G On_tRhIrNeGa,d sN(CnCtLh_rPeRaOdTsO)_,S ItMiPdLIEn,B l2o)c k (| t^h readIdx.x), gr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.ho:u611p:(62g:r onote: uexpanded from macro 'DEFINE_ncclDevFunc'p ), | ^~~~~~~~~~~~~~~~~ 611 | RunWork/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hB:a670t:c60h:< cnote: ofield 'group' will be initialized after field 'stepSize'l l, ty, redop | , a l gtoi,d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:757: warning: unused variable 'w' [-Wunused-variable] 75 | | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15 : note: expanded from macro 'barrier_by_group' 29 | co nst int w = threbadIdx.x/WARPa_SIZE;r \ | ^ rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxze(stepSize_ == 0 .x), group(? ncgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)11 warnings generated when compiling for gfx1100. , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' roto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hPLE:]/670N:C15C:L _warning: STinitializer order does not match the declaration order [-Wreorder-ctor]E PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hx:.33x:)7,: gnote: roin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereu p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 33 | prims(t i671d | , n t h rsetaedpsS,i z&er(isntg-e>ppSreivz,e _& r=i=n g0- ?> nnecxcltS,h mweomr.kc-o>msme.nbdubfufSfifz,e sw[oNCrCkL-_>PrReOcTvOb_ufSfI,M PwLoEr]k/-N>CrCeLd_OSpTAErPgS,/ s0i,z ewoorfk(-T>)c o:n nsItnepdSeixz,e _w)o r{k - >| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c o n| n group(groupI ndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h :6333 | : 7 : note: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereu nRing((ttiidd,, nntthhrreeaaddss,, w&orrikn)g;- > p| r ^e v, &ring->next, w/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:o432r:k78-:> snote: ein instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested heren dbuff, work->rec v432b | u f f , w oirfk -(>triedd O

Ccoolnln cAolngnoI, nPdreoxt)o;, C| O ^L L_UNROLL>()./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.hr:u63n:(5t:i dnote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here subtn, wo r63k | ) ; | r ^u nRing, 1, 2, 2>::run' requested here, COLL_UNROLL>(ti d7 | ,D nEtFhIrNeaE_dns,c cwlorDke); vF u| n ^c (Reduc/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.he:_432:R78:I note: NGin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here_ SIMPLE _432S | u m P o sift D(itvi_du 8< _s2u,b tnncc) lFRuunnWcoRrekdCuoclel,O(_).RrIuNnG(t, iNdCC,L _suPbRtOnT,O _wSorIMk);P L| E ^, 2) /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp| :^12: 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 12 | DEFI N611E | _ n c c lRDuenvWFournkcB(aRtecdhus,t Dailvg_ou,6 4p_r4o,t on,c culnFruonlclR>e(d)u.creu,n (F)u;n c\S u m| P ^o stDiv, uint64_/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.ht:,670 :N15C:C note: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ ALGO_RING, N670C | C L _ P RtOiTdO(_tSiIdM)P,L En,t hr4ea)d s (| n^t hreads), tidInB/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.hl:o611c:k62(:t hnote: rexpanded from macro 'DEFINE_ncclDevFunc'e adIdx.x), gr o611u | p (g r ou pR)u, n W| o ^~~~~~~~~~~~~~~~~ rkBat/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.hc:h670<:c60o:l note: lfield 'group' will be initialized after field 'stepSize', ty, redo p670< | t y > ,t iadl(tgido),, pnrtohtroe,a dusn(rntohlrle>a(d)s.)r,u nt(i)d;I n\B l o| ^epSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com:33:m7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here. buffSiz es33 | pri[ms(tid, NCnthCreads, &ring->prev, &ring->neLxt, work->sendbuf_PRf, work->recvbuffO, work->redOpArg, 0, TO_work->connIndSex, worIMkPL->connIndex); | ^ E]/NCC/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ L_STE/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrPS/sioll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthzeof(T) In file included from reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(ti:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp 12 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60400 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/git_version.cpp In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cppint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:257:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 257 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | In file included from prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()./builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:269:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 269 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, uIn file included from nr/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cppo:l2l: >In file included from (/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h):.11: rIn file included from u/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.hn:(173): ; /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h\ : 670| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 670 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohrceka(dtIhdrxe.axd)I,d xg.rxo)u,p( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: 671field 'group' will be initialized after field 'stepSize' | stepSize (670s | t e p S itzied_(t i=d=) ,0 n?t hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~o f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx906. [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/hipcc -fPIC -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -O2 -g -DNDEBUG -parallel-jobs=1 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/register.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.4.43482 --hip-link --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -lpthread -lrt -ldl -ldrm -ldrm_amdgpu clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] Elapsed time (seconds): 5057.17 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' [100%] Built target rccl gmake[1]: Leaving directory '/builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-rocm-6.4.0/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.2JrNp5 + umask 022 + cd /builddir/build/BUILD + '[' /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 '!=' / ']' + rm -rf /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 ++ dirname /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 + mkdir -p /builddir/build/BUILDROOT + mkdir /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.0 + DESTDIR=/builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "RelWithDebInfo" -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/librccl.so.1.0 -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/librccl.so.1 -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/librccl.so -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/include/rccl/rccl.h -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/include/rccl/nccl_net.h -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/include/rccl/amd_detail/api_trace.h -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb-op.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb-op.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple-op.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple_2.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/alltoall-8n-0-9kb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/alltoall-8n-190kb-512kb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/alltoall-8n-512kb-7mb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/alltoall-8n-7mb-43mb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-algorithms/alltoall-8n-9kb-190kb.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/cmake/rccl/rccl-targets-relwithdebinfo.cmake -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl/LICENSE.txt + echo s@/builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64@@ + find /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64 -name '*.so.*.[0-9]' + sed -f br.sed + find /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64 -name '*.so.[0-9]' + sed -f br.sed + sed -f br.sed + find /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64 -name '*.so' + find /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/lib64 -name '*.cmake' + sed -f br.sed + '[' -f /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl/LICENSE.txt ']' + rm /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl/LICENSE.txt + /usr/bin/find-debuginfo -j4 --strict-build-id -m -i --build-id-seed 6.4.0-2.el10 --unique-debug-suffix -6.4.0-2.el10.x86_64 --unique-debug-src-base rccl-6.4.0-2.el10.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/rccl-rocm-6.4.0 find-debuginfo: starting Extracting debug info from 1 files DWARF-compressing 1 files dwz: ./usr/lib64/librccl.so.1.0-6.4.0-2.el10.x86_64.debug: Unknown debugging section .debug_str_offsets sepdebugcrcfix: Updated 0 CRC32s, 1 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/rccl-6.4.0-2.el10.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink Processing files: rccl-6.4.0-2.el10.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.FjIC9z + umask 022 + cd /builddir/build/BUILD + cd rccl-rocm-6.4.0 + LICENSEDIR=/builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/licenses/rccl + export LC_ALL= + LC_ALL= + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/licenses/rccl + cp -pr /builddir/build/BUILD/rccl-rocm-6.4.0/LICENSE.txt /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/licenses/rccl + RPM_EC=0 ++ jobs -p + exit 0 Provides: librccl.so.1()(64bit) rccl = 6.4.0-2.el10 rccl(x86-64) = 6.4.0-2.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_4.3)(64bit) libamdhip64.so.6(hip_4.5)(64bit) libamdhip64.so.6(hip_5.0)(64bit) libamdhip64.so.6(hip_5.3)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.3)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.6)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_3.0)(64bit) librocm_smi64.so.1()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Processing files: rccl-devel-6.4.0-2.el10.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.RmIfxq + umask 022 + cd /builddir/build/BUILD + cd rccl-rocm-6.4.0 + DOCDIR=/builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl-devel + export LC_ALL= + LC_ALL= + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl-devel + cp -pr /builddir/build/BUILD/rccl-rocm-6.4.0/README.md /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64/usr/share/doc/rccl-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(rccl) = 2.22.3 rccl-devel = 6.4.0-2.el10 rccl-devel(x86-64) = 6.4.0-2.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(x86-64) librccl.so.1()(64bit) Processing files: rccl-data-6.4.0-2.el10.noarch Provides: rccl-data = 6.4.0-2.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debugsource-6.4.0-2.el10.x86_64 Provides: rccl-debugsource = 6.4.0-2.el10 rccl-debugsource(x86-64) = 6.4.0-2.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debuginfo-6.4.0-2.el10.x86_64 Provides: debuginfo(build-id) = eef9f7691c203fdda7a4a7c89f2090c2671c9255 librccl.so.1.0-6.4.0-2.el10.x86_64.debug()(64bit) rccl-debuginfo = 6.4.0-2.el10 rccl-debuginfo(x86-64) = 6.4.0-2.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: rccl-debugsource(x86-64) = 6.4.0-2.el10 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 Wrote: /builddir/build/RPMS/rccl-debugsource-6.4.0-2.el10.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-devel-6.4.0-2.el10.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-debuginfo-6.4.0-2.el10.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-data-6.4.0-2.el10.noarch.rpm Wrote: /builddir/build/RPMS/rccl-6.4.0-2.el10.x86_64.rpm Executing(%clean): /bin/sh -e /var/tmp/rpm-tmp.VrKd3t + umask 022 + cd /builddir/build/BUILD + cd rccl-rocm-6.4.0 + /usr/bin/rm -rf /builddir/build/BUILDROOT/rccl-6.4.0-2.el10.x86_64 + RPM_EC=0 ++ jobs -p + exit 0 Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.iOI66G + umask 022 + cd /builddir/build/BUILD + rm -rf /builddir/build/BUILD/rccl-rocm-6.4.0-SPECPARTS + rm -rf rccl-rocm-6.4.0 rccl-rocm-6.4.0.gemspec + RPM_EC=0 ++ jobs -p + exit 0 RPM build warnings: bogus date in %changelog: Tue Apr 23 2025 Tom Rix - 6.4.0-2 Finish: rpmbuild rccl-6.4.0-2.el10.src.rpm Finish: build phase for rccl-6.4.0-2.el10.src.rpm INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.log /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.librepo.log /var/lib/mock/centos-stream+epel-10-x86_64-1746813413.875217/root/var/log/dnf.rpm.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/rccl-6.4.0-2.el10.src.rpm) Config(child) 137 minutes 36 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "rccl-debuginfo", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "src" }, { "name": "rccl-devel", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "x86_64" }, { "name": "rccl-debugsource", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "x86_64" }, { "name": "rccl-data", "epoch": null, "version": "6.4.0", "release": "2.el10", "arch": "noarch" } ] } RPMResults finished